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MAMMALIAN MATRIX-TYPE METALLOPROTEASE 

This filing claims priority from United States 
Application Serial No. 09/005,263, filed January 9, 1998, 
the disclosure of which is hereby fully incorporated by 
reference . 



FIELD OF THE INVENTION 
The present invention contemplates compositions 
related to proteins from animals, e.g., mammals, which 
function as proteinases. In particular, it provides 
15 nucleic acids which encode the proteinases, antibodies 
to, and proteins which exhibit biological functions, 
e.g., capacity to degrade proteinaceous substrates. 



BACKGROUND OF THE INVENTION 
The proteases are a very broad group of enzymes 
which carry out an enzymatic function of hydrolyzing a 
peptide bond. See, e.g., Beynon (ed. 1989) Proteolytic 
Enzymes: A_J £ag£jca3 Amroach IRL Press, Oxford; Methods 
in Rn^nloov vols . 244 and 248. Within the group, there 
25 is a wide range of substrate specificities for the amino 
acids adjacent the cleavage sites. Proteases are 
typically categorized on the basis of their catalytic 
mechanisms, e.g., based upon studies of their active 
sites, or by the effects of pH. Four main categories of 
30 proteases are serine proteinases, sulfhydryl proteases, 
acid proteases, and metalloproteases . They may also be 
classified according to their sites of substrate 
cleavage, e.g., endoproteases , amino peptidases, or 
carboxy peptidases . 
3 5 Proteases have traditionally held a large share of 

the industrial enzyme market. Proteases are used in many 
industrial processes, including in detergents and 
cleaning products, e.g., to degrade protein materials 
such as blood and stains, in leather production, e.g., to 
remove hair, in baking, e.g., to break down glutens, in 
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flavorings, e.g., soy sauce, in meat tenderizing, e.g., 
to break down collagen, in gelatin or food supplement 
production, in the textile industry, in waste treatment, 
and in the photographic industry. See, e.g., Gusek 
5 (1991) Inform 1:14-18; Zamost, et al . (1996) J. 

Industrial Microbiol . 8:71-82; James and Simpson ( 1996 ) 
CRC Critical Reviews in Food Science and Nutrition 
36:437-463; Teichgraeber , et al . (1993) Trends in Food 
Science and Technology 4:145-149; Tjwan, et al . (1993) 

10 Dairy Research 60:269-286; Haard (1992) J. Aquatic Food 
Product Technology 1:17-35; van Dijk (1995) Laundry and 
Cleaning News 21:32-33; Nolte, et al . (1996) J. Textile 
Institute 87:212-226: Chikkodi , et al . (1995) Textile 
Res. J. 65 : 564-569; ■ and Shih (1993) Poultry: Science 

15 72:1617-1620. 

Matrix metalloproteinases (MMPs) are a family of 
enzymes whose main physiological function is degradation 
of the extracellular matrix. See, e.g., Parsons, et al. 
(1997) Br.' J. Surgery 84:160-166. These enzymes are 

20 present in normal healthy individuals and have been shown 
to have an important role in processes such as wound 
healing (see Wolf, et al . (1992) J. Invest. Dermatol. 
99:870-872; and Wysocki, et al . (1993) J. Invest. 
Dermatol . 101:64-68), pregnancy and parturition (see 

25 Jeffrey (1991) Seminars Perinatol . 15:118-126), bone 

resorption (see Delaisse and Vaes, pp. 290-314 in Rifkin 
and Gay (eds. 1992) Biology and Physiology of the 
Osteoclast CRC Press, Ratan, FL) , and mammary involution 
(Talhouk, et al . (1992) J. Cell Biol. 118:1271-1282). 

30 See also Nagase (1996) in Hooper (ed. ) Zinc 

Metalloproteinases in Health and Disease Taylor and 
Francis, London. A recent focus on the MMPs is on their 
role in certain disease states in which breakdown of the 
extracellular matrix is a key feature, e.g., in 

35 rheumatoid arthritis (see Harris (1990) NEJ Med. 

322:1277-1289), periodontal disease (see Page (1991) J . 
Periodont. Res. 26:230-242), and cancer (see Brown (1997) 
Medical Oncology 14:1-10; Chambers and Matisian (1997) J . 
NCI 89:1260-1270; Yu, et al . (1997) Drugs and Aaina 
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11:229-244; Yu, et al . (1997) Clinical Pharmacology 
11:229-244; Wo j towicz-Praga , et al. (1997) Invest. New 
Druqs 15:61-75; Coussens and Werb (1996) Chem. Biol. 
3:895-904; and Talbot and Brown (1996) F.ur. J. Cancer 

5 32A:2528-2533) - 

While there are many uses for proteases, there is 
always the need for a more active or specific protease 
under various specific conditions. Alternatively, the 
distribution of these gene products may be useful as 
10 markers for specific cell or tissue types. There is 
need for new proteinases of differing properties, 
specificities, and activities. 
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SUMMARY OF THE INVENTION 
In a search for DC restricted molecules, a novel 
member of the MMP family of proteolytic enzymes was 
identified which belongs to the Membrane-type Matrix 
Metalloproteinase (MT-MMP) subclass. This fifth MT -MMP 
proteinase, located on chromosome 16pl3.3, is present in 
spleen, lymph node, thymus, appendix, PBL, and bone 
marrow, and strongly expressed by DC and weakly by 
granulocytes and effector T cells. Interestingly, the 
mRNA expression of this gene is. down-regulated by CD40L 
activation of CD34+- and monocyte -derived DC. According 
25 to its cellular expression and putative membrane 
localization, a role is proposed for this novel 
Membrane-type Matrix Metalloproteinase gene in 
degradation of the extracellular matrix during DC 
migration . 

30 The present invention provides a binding compound 

comprising an antibody binding site which specifically 
binds to primate F06B09 protein; a nucleic acid 
comprising sequence encoding at least 12 amino acids of 
primate F06B09 protein; a substantially pure protein 
35 which is specifically recognized by the above antibody 

binding site; a substantially pure primate F06B09 protein 
or peptide thereof; and a fusion protein comprising a 3 0 
amino acid sequence portion of primate F06B09 protein 
sequence. 
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In certain binding compound embodiments, the 
antibody binding site is specifically immunoreactive with 
a protein selected from polypeptides of SEQ ID NO: 4; is 
raised against a purified or recombinantly produced 
5 primate F06B09 protein; is immunoselected on a 

substantially purified or recombinantly produced primate 
F06B09 protein; is in a monoclonal antibody, Fab, or 
F(ab)2; is detectably labeled; is attached to a solid 
substrate; is from a rabbit or mouse; binds with a Kd of 
10 at least about 3 00 JIM; is fused to another protein 

segment; is in a chimeric antibody; or is coupled to 
another chemical moiety. 

The invention also provides a method of making an 
antigen-antibody complex, comprising a step of contacting 
15 a primate biological sample to a specific binding 

antibody described. In preferred embodiments, the method 
further includes steps to purify the antigen or antibody. 

Alternative embodiments provide an antibody binding 
site wherein the binding site is detected in a biological 

2 0 sample by a method comprising the steps of contacting a 

binding agent having an affinity for F06B09 protein with 
the biological sample; incubating the binding agent with 
the biological sample to form a binding agent :F06B09 
protein complex; and detecting the complex. In certain 
25 embodiments, the biological sample is human, and the 
binding agent is an antibody. 

The invention also provides kits containing a 
composition described above and instructional material 
for the use of the composition; or segregation of the 

3 0 composition into a container. Typically, the kit is used 

to make a qualitative or quantitative analysis. 

The invention also embraces a cell comprising an 
antibody described above; a cell transfected with a 
nucleic acid described above; or a cell comprising a 
3 5 fusion protein described above. 

In nucleic acid embodiments, the nucleic acid may 
encode a polypeptide which specifically binds an antibody 
generated against an immunogen selected from the group 
consisting of the mature polypeptides of SEQ ID NO: 4. 
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Alternatively, the nucleic acid may encode at least 12 
amino acids of SEQ ID NO: 4; comprise sequence of at 
least about 3 9 nucleotides selected from protein coding 
portions of SEQ ID NO: 1 or 3; hybridize to SEQ ID NO: 1 
5 or 3 under stringent wash conditions of at least 45° C 
and less than about 150 mM salt; comprise sequence made 
by a synthetic method; be an expression vector; be 
detectably labeled; be attached to a solid substrate; be 
from human; bind with a Kd of at least about 300 pM; be 
10 fused to another nucleic acid segment; be coupled to 
another chemical moiety; be operably associated with 
promoter, ribosome binding site, or poly-A addition site; 
be a PCR product; be transformed into a cell, including a 
bacterial cell; be in a sterile composition; be capable 
15 of selectively hybridizing to a nucleic acid encoding an 
F06B09 protein; comprise a natural sequence; comprise a 
mature- protein coding segment of SEQ ID NO: 1 or 3 ; 
encode proteolytically active portion of F06B09; be 
detected in a biological sample by a method comprising: 
20 contacting a biological sample with a nucleic acid probe 
capable of selectively hybridizing to said nucleic acid, 
incubating the nucleic acid probe with the biological 
sample to form a hybrid of the nucleic acid probe with 
complementary nucleic acid sequences present in the 
25 biological sample; and determining the extent of 
hybridization of the nucleic acid probe to the 
complementary nucleic acid sequences, including the 
method where the nucleic acid probe is capable of 
hybridizing to a nucleic acid encoding a protein selected 
30 from the group consisting of the mature polypeptides of 
SEQ ID NO 4. 

in protein or polypeptide embodiments, the proteins 
may bind with a Kd of at least about 300 to an 
antibody generated against an immunogen of the 
3 5 polypeptides of SEQ ID NO: 4; be immunoselected on an 

antibody which selectively binds a polypeptide of SEQ ID 
NO: 4; comprise sequence of at least 12 contiguous 
residues of SEQ ID NO: 4; exhibit a post-translational 
modification pattern distinct from natural F06B09; be 3- 
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fold or fewer substituted from natural sequence; be 
recombinantly produced; be denatured; have sequence of 
full length natural polypeptide; be detectably labeled; 
be attached to a solid substrate; be from human; be in a 
5 sterile composition; be fused to another protein segment ,- 
be coupled to another chemical moiety; comprise at least 
a fragment of at least 3 2 amino acid residues from a 
human F06B09 protein; comprise mature polypeptide 
sequence selected from the group consisting of SEQ ID NO 
10 2 and 4; be a soluble protein; be a naturally occurring' 
protein; or be a proteolytically active portion of 
F06B09. 

The invention also provides an isolated protein 
which specifically binds to an antibody generated against 
15 an iitimunogen selected from the group consisting of the 
full length polypeptides of SEQ ID NO: 4. Preferably 
such protein binds to the antibody with a Kd of at least 
about 3 00 |HM; is immunoselected on an antibody which 

selectively binds a polypeptide of SEQ ID NO: 4; 

2 0 comprises sequence of at least 12 contiguous residues of 

SEQ ID NO: 4; exhibits a post-translational modification 
pattern distinct from natural F06B09; is 3-fold or fewer 
substituted from natural sequence; is recombinantly 
produced; is denatured; has sequence of full length 
25 natural polypeptide; is detectably labeled; is attached 
to a solid substrate; is from human; is in a sterile 
composition; is fused to another protein segment; is 
coupled to another chemical moiety; comprises at least a 
fragment of at least 3 2 amino acid residues from a human 

3 0 F06B09 protein; comprises mature polypeptide sequence 

selected from the group consisting of SEQ ID NO 4; is a 
soluble protein; or comprises a proteolytic activity of 
F06B09. 

In certain other embodiments, the invention embraces 
3 5 a fusion protein described above, which comprises 

sequence from an enzymatically active portion of SEQ ID 
NO.: 4 . Preferably such protein binds with a Kd of at 
least about 3 00 pM to an antibody generated against an 

immunogen having sequence of a polypeptide of SEQ ID NO. 
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4; is immunos elected on an antibody which selectively 
binds a polypeptide of SEQ ID NO: 4; comprises sequence 
of at least 12 contiguous residues of SEQ ID NO: 4; is 
recombinants produced; is denatured; has sequence of 
full length natural polypeptide; is detectably labeled; 
is attached to a solid substrate; comprises sequence from 
human; is in a sterile composition; is fused to another 
protein segment; is coupled to another chemical moiety; 
comprises at least a fragment of at least 32 amino acid 
0 residues from a human F06B09 protein; comprises mature 
polypeptide sequence from SEQ ID NO 4; is a soluble 
protein; or comprises a proteolytic activity of F06B09 . 

The invention also provides a substantially pure 
protein described above which comprises a proteolytic 

5 activity of F06B09 . 

A method of modulating physiology or development of 
a cell comprising contacting said cell with said 
compositions is provided. 

0 DETAILED DESCRIPTION 

I . General 

Dendritic cells (DC) , present in all lymphoid and 
non lymphoid organs, are professional antigen presenting 
cells (APC) which have the unique capacity to activate 
>5 naive T cells . See, e.g., Banchereau and Steinman (1998) 
Nature 392:245-252; and Steinman ( 1991) Rev. 
Tmmunol. 9:271-296. DC, originated from bone-marrow, 
migrate as precursors through bloodstream to non lymphoid 
tissues where, at immature stage, DC such as the 
30 epidermal Langerhans cells capture antigens with high 
efficiency and become circulating veiled cells. These 
cells bearing antigens migrate from the peripheral non 
lymphoid tissues via lymphatics or bloodstream into 
lymphoid tissues where they localize in T cell-rich areas 
35 as mature interdigitating DC (IDC). See, e.g., Austyn 

(1996) .t Exek Med. 183:1287-1292; Austyn, et al . (1988) 
,T. F. m . Med. 167:646-651; Fossum (1988) Scand. J. 
Tmmunol . 27:97-105; Hoefsmit, et al . (1982) 
T^mnWoloav . 161:255-265; Kripke, et al . (1990) J, 
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Immunol. 145:2833-2838; Larsen, et al . (1990) J, B>m _ 
Meek. 172:1483-1494; Macatonia, et al . (1987) J. Eyn. 
166:1654-1667; Roman i , et al . (1989) J. Exp. M^ri 
169:1169-1178. At this site, IDC efficiently present 
5 processed Ags to naive T cells and generate a specific 
immune response. See, e.g., Inaba, et al . (1983). Proc . 
Natl . Acad. Sci ; USA . 80:6041-6045; and Inaba and 
Steinman (1985) Science. 229:475-479. Thus, migration 
constitutes an integral part of DC function. 

10 The recruitment of DC into a site of tissue damage 

and the subsequent migration of DC into secondary 
lymphoid organs is dependent upon a dynamic and complex 
series of events, including activation by inflammatory 
stimuli. See, e.g., Butcher (1991) Cell . 67:1033-1036. 

15 This mechanism, implies transendothelial migration beyond 
the vascular compartment involving the expression of 
integrin molecules, the movement along leukocyte specific 
chemotactic gradients (Taub (1996) Cytokine Growth Factor 
Rev. 7:355-376) and possibly the secretion of matrix- 

20 degrading enzymes (Watanabe, et al . (1993) J. Cell SrH . 
104:991-999). In addition, the trafficking of DC into 
tissues involves breaching the basement membrane (dermo- 
epidermic junction) , which would necessitate the 
production of a matrix -degrading enzyme. 

25 Matrix metalloproteinases , or matrixins , represent a 

group of structurally related zinc -dependent 
endopeptidases that are involved in extracellular matrix 
and basement membrane degradation and cell-matrix 
interactions. See, e.g., Basbaum and Werb (1996) Curr . 

30 Ooin. C ell Biol. 8:731-73 8; Birkedal-Hansen, et al. 

(±993) Crit. Rev. Oral Biol. Med. 4:197-250; Mignatti and 
Rifkin (1993) Phvsiol. Rev. 73:161-195; and Stetler- 
Stevenson, et al . (1993) Annu. Rev. Cell Biol . 9:541-573. 
They play crucial roles in tissue remodeling in normal 

3 5 and pathological processes, including development, 

repair, and cancer progression. All MMPs identified to 
date are synthesized as an inactive proenzyme form or 
zymogen, contain zinc-binding sites, and need proteolytic 
activation to become functional proteases. According to 
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their structural features and substrate specificity, four 
subclasses of MMPs have been established: collagenases 
have the unique capacity to degrade fibrillar collagens; 
gelatinases denature basement membranes and denatured 
5 collagens; stromelysins. degrade many extracellular 
proteins, including proteoglycans, laminin, and 
fibronectin; and membrane-type MMPs are supposed to have 
proteolytic activity on other MMPs, required for their 
activation. See Matrisian (1992) Bioessays. 14:455-463; 
10 Stetler-Stevenson, et al . (1993) Pnrni F-v Cell Biol. 
9:541-573; and Woes sner (1991) FASEB J. 5:2145-2154. 
Among the 15 identified MMPs, four distinct members, 
presenting a transmembrane domain at the C terminus, have 
been described and are referred to as MT-MMP (Membrane- 
15 type matrix metalloproteinase) . Sato, et al . (1994) 

Nature 370:61-65 first identified the MT1-MMP (MMP 14), 
responsible for the activation of progelatinase A (pro- 
MMP2) on the tumor cell surface that may trigger tissue 
invasion by tumor cells . Three additional members of 
20 this family, MT2-MMP (MMP 15), MT3-MMP (MMP 16) and 

MT4-MMP (MMP 17) have been isolated respectively from 
lung, placenta, and breast carcinoma cDNA libraries. See 
Puente, et al . (1996) CVmcor Res . 56:944-949; Takino, et 
al (1995) J BioJ - Chem. 270:23013-23020; and Will and 
25 Hinzmann (1995) Eur ,T Biochem. 231:602-608. All 
MT-MMPs, like the stromelysin-3 MMP (Basset, et al . 
(1990) Nature 348:699-704), contain a consensus insertion 
of about ten amino acids (RxK/RR) between the propeptide 
and the catalytic domain, corresponding to potential 
30 cleavage sites by enzymes called furin (Basbaum and Werb 
(1996) CUXE QBiB ££13 8:731-8; Sang and Douglas 

(1996) T Er^iB Chem. 15:137-160). This cleavage xs 
necessary to give rise to an activated form of MT-MMPs . 
The MT-MMPs are located in ternary complexes including a 
35 substrate, a tissue inhibitor of MMPs (TIMPs) , and an 
activated MT-MMP, associated with the plasma membrane 
(Stetler-Stevenson, et al . (1993) Annn Rev CaU Biol. 
9-541-573). As described for MT1-, MT2-, and MT3 -MMPs , 
MT-MMPs may have a proteolytic activity on other MMPs 



WO 99/35276 



10 



PCT/US98/26214 



like gelatinase A (pro-MMP2) and collagenase-3 (MMP 13) 
(see Butler, et al . (1997) Eur. J. Bioch^m . 244:653-657; 
Knauper, et al . (1996) J. Biol. Chem. 271:17124-17131; 
Kolkenbrock, et al . (1997) Biol. Chem. 378:71-76; Sato, 
5 et al. (1994) Nature 370:61-65; Strongin, et al . (1993) 
J. Biol. Chem. 268:14033-14039; and Takino, et al. (1995) 
J. Biol. Chem. 270:23013-23020). 

The present invention provides DNA sequences 
encoding mammalian proteins which exhibit structural 

10 properties or motifs characteristic of a protease, more 
particularly a matrix metalloproteinase. The proteins 
described herein are designated F06B09 . A human F06B09 
nucleotide and predicted amino acid sequence are provided 
in SEQ ID NO: 1 and 2. The signal sequence /cleavage is 

15 prediced to be at nucleotide 344. The predicted 

extracellular domain is expected at about 1-527; the 
transmembrane segment occurs at about 52 8-543; the 
cytoplasmic domain is found at about 544-545. 

An alternative sequence of F06B09 is provided in SEQ 

20 ID NO: 3 and 4. Notable motifs include: predicted signal 
sequence until about nucleotide 343; propeptide domain 
from about 1-66; C switch motif from about 67-73; furin 
site from about 82-86; catalytic site from about 87-211; 
zinc binding site from about 212-222, with notable His at 

25 212, 216, and 222; hinge region from about 260-290; 

hemopexin-like domain from about "291-525 ; transmembrane 
segment from about 52 6-53 8; and cytoplasmic tail from 
about 539-542 . 

Through an effort aiming at the identification of 

3 0 human dendritic cells (DC) specific genes, the cDNA 
coding for a fifth member of the human Membrane-type 
Matrix Metalloproteinases (MT-MMP) family has been 
cloned. The full-length 3 691 bp cDNA which was mapped on 
chromosome 16pl3.3, contains an open reading frame of 

35 some 1689 bp, encoding a 562 amino acid protein. The 
predicted protein was most homologous (48% amino acid 
homology) with the human matrix metalloproteinase MT4-MMP 
and has the typical features of member of the MMP family, 
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including a prodomain with the activation locus, the zinc 
binding site, and the hemopexin domain. 

The general roles of matrix metalloproteases are 
described above. The specific interaction of matrix 
5 metalloproteinases with other proteins, e.g., furin and 
progelatinase, are described in Basbaum and Werb (1996) 
Qurj^ Omn ion is QeJJ Bj oloav 8:731-73 8. Matrix 
metalloproteases are typically zinc endopeptidases that 
are required for the degradation of extracellular matrix 
10 components during normal embryo development, 

morphogenesis, and tissue remodeling. Their proteolytic 
activities are precisely regulated by endogenous tissue 
inhibitors of metalloproteases (TIMPS) . Disruption of 
this balance results in diseases such as arthritis, 
15 atherosclerosis, and tumor growth and metastasis . Nagase 
(1996) in Hooper (ed. ) £inc, MP.tallonrnf.einases in Health 
ang Disease Taylor and Francis, London; Coussens and Werb 
(1996) £heiL Biol. 3:895-904. Therefore, F06B09 gene 
product could play a role in the migration of the 
20 dendritic cells (DC) or in the progression of the 

dendrites between the stromal cells. The way the MMPs 
act on the matrix is complex. The MMP is typically 
produced as an inactive proenzyme that needs to be 
processed by another protease, most probably furin, since.. 
25 F06B09 contains a site of cleavage for this convertase. 

This process probably occurs intracellular^ (Basbaum and 
Werb (1996) Current DjD j nioi] in ^11 Biology 8 : 731-738 ) , 
and is likely followed by an interaction of F06B09 with 
other proteases like progelatinase. 
30 A single 3.7 Kb mRNA transcript of this gene was 

found to be mainly expressed in CD34+-derived human DC 
and also weakly in in vitro generated granulocytes . No 
signal was detected in TFl, CHA, Jurkat, MRC5, or U937 
cell lines, nor in freshly isolated monocytes , actxvated 
3 5 T and B cells, and activated peripheral blood lymphocytes 
(PBLs) . Among normal adult human tissues, this mRNA was 
detected in spleen, lymph node, thymus, appendix, and 
bone marrow, but no expression was found in fetal 
tissues. RT-PCR distribution analysis showed a 
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significant expression of the novel MT-MMP in activated 
DC and weakly in JY B cell line. Interestingly, it was 
found that the novel MT-MMP iaRNA expression was down- 
regulated upon DC activation with CD40L. The expression 
5 pattern of this gene, which is predominantly expressed by 
DC, together with its putative membrane localization, 
suggest that it could be involved in the degradation of 
the extracellular matrix during DC migration. 

The descriptions below are directed, for exemplary 

10 purposes, to primate embodiments, e.g., human, but are 
likewise applicable to related embodiments from other, 
e.g., natural, sources. Other ESTs have been identified 
from rodent cDNA libraries . These sources should, where 
appropriate, include various vertebrates, typically warm 

15 blooded animals, e.g., birds and mainmals, particularly 
domestic animals, and primates. The sequences exhibit 
significant similarity to membrane- type matrix 
metalloproteases MT-MMP 1 to 4. Table 1 shows an 
alignment of the family members. 
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Table 1- Comparison of various MMPs with F06B09. MT4-MMP (SEQ ID 
NO- 7) is from Genbank 3466295 (EMBL X89576); MT2-MMP (SEQ ID NO: 
8) is from Genbank 1418215 (DDBJ D86331; see also Genbank 3646295 
(EMBL Z48482); MT1-MMP (SEQ ID NO: 9) is from Genbank 804994 { EMBL 
X83535; see also Genbank 1495995 (EMBL X90925) and 793763 (DDBJ 
D26512); and MT3-MMP (SEQ ID NO: 10) is from Genbank 2424979 (DDBJ 
D85511). 



10 



MT4-MMP 
F06B09 
MT2 -MMP 
MT1-MMP 
MT3 -MMP 



1 u 

1 MRLRLRLLALLLLLLAPPARAPKPSAQDVSLGVDWLTRY 3 9 

1 0 

1 MSPAPRPSRCLLLPLLTLGTALASLGSAQSSSFSPEAWLQQY 42 

1 MILLTFSTGRRLDFVHHSGVFFLQTLLWILCATVCGTEQYFNVEVWLQKY 50 



15 



20 



MT4-MMP 
F06B09 
MT2 -MMP 
MT1-MMP 
MT3 -MMP 



1 MQQFGGLEATGILDEATLALMKTPR 25 

40 GYLPPPHPAQAQLQSPEKLRDAIKVMQRFAGLPETGRMDPGTVATMRKPR 89 

x MKRPR 5 

43 GYLPPGDLRTHTQRSPQSLSAAIAAMQKFYGLQVTGKADADTMKAMRRPR 92 
51 GYLPPTDPRMSVLRSAETMQSALAAMQQFYGINMTGKVDRNTIDWMKKPR 100 



MT4-MMP 26 CSLPDLP-VLTQARRR— RQ- -APAPTKWNKRNLSWRVRTFPRDSPLGHD 70 

F06B09 90 CSLPDVL-GVAGLVRR-- RRRYALSGSVWKKRTLTWRVRSFPQSSQLSQE 136 

2 5 MT2 -MMP 6 CGVPDQFGVRVKANLRRRRKRY ALTGRKWNNHHLT F S I QNYT - - EKLGWY 5 3 

MT1-MMP 93 CGVPDKFG AE I KANVR- - RKRYAI QGLKWQHNE I T FC I QNYT - - PKVGE Y 138 

MT3-MMP 101 CGVPDQTRGSSKFHIR— RKRYALTGQKWQHKHITYSIKNVT — PKVGDP 146 
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35 



40 



MT4-MMP 

F06B09 

MT2-MMP 

MT1-MMP 

MT3-MMP 



MT4-MMP 

F06B09 

MT2-MMP 

MT1-MMP 

MT3-MMP 



71 TVRALMYYALKVWSDI APLNFHEVA- - -GS TADIQIDFSKADHND 112 

137 TVRVLMSYALMAWGMESGLTFHEVDSPQGQ EPDILIDFARAFHQD 181 

54 HSMEAVRRAFRVWEQATPLVFQEVPYEDIRLRRQKEADIMVLFASGFHGD 103 
13 9 ATYEAIRKAFRVWESATPLRFREVPYAYIREGHEKQADIMIFFAEGFHGD 188 
147 ETRKAIRRAFDVWQNVTPLTFEEVPYSELENGK-RDVDITIIFASGFHGD 19 5 
* ^ ★*'★★ **.*. * * 

113 GYPFDGPGGTVAHAFFPGHHHTAGDTHFDDDEAWTFRSSDAHGMDLFAVA 162 
182 SYPFDGLGGTLAHAFFPGEHPISGDTHFDDEETWTFGSKDGEGTDLFAVA 231 
104 SS PFDGTGGFLAHAYFPGPG - LGGDTHFDADEPWTF SSTDLHGNNLFLVA 152 
189 STPFDGEGGFLAHAYFPGPN-IGGDTHFDSAEPWTVRNEDLNGNDIFLVA 237 
19 6 SSPFDGEGGFLAHAYFPGPG-IGGDTHFDSDEP'WTLGNPNHDGNDLFLVA 2 44 



* * * * * ★ 
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50 



55 



MT4-MMP 
F06B09 
MT2 -MMP 
MT1-MMP 
MT3-MMP 



MT4 -MMP 
F06B09 
MT2-MMP 
MT1-MMP 
MT3 -MMP 



163 VHEFGHAIGLSHVAAAHSIMRPYYQGPVGDPLRYGLPYEDKVRVWQLYGV 212 
232 VHEFGHALGLGHSSAPNSIMRPFYQGPVGDPDKYRLSQDDRDGLQQLYG- 2 80 
153 VHELGHALGLEHSSNPNAIMAPFYQWKDVDN— FKLPEDDLRGIQQLYGT 200 
238 VHELGHALGLEHS SDP SAIMAPFYQWMDTEN — FVLPDDDRRGI QQLYGG 285 
245 VHELGHALGLEH SNDPT AIMA P F YQ YMETDN - - FKL PNDDLQG I QKI YGP 2 92 
*** ** * .** *..** . . * -* - .-** 



213 RESVSPTAQ — PEEPPLLP- 



EP PDNRSSA 23 8 

2 81 KAPQTPYDK- -PTRKPLAP PPQ PPASPTH 3 07 

201 PDGQPQPTQPLPTVTPRRPG RPDHRPPRPPQPPPPGGKPERPPKP 2 45 

2 86 ESG FPTKMPPQP RTTSRP SVPDKPKNP 312 

293 PDKIPPPTRPLPTVPPHRSIPPADPRKNDRP KPPRPPTG 331 
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MT4-MMP 23 9 PPR -KDVPHRCSTHFDAVAQIRGEAFFFKGKYFWRLTRDRH 278 

F06B09 308 SPS FPIPDRCEGNFDAIANIRGETFFFKGPWFWRLQPSGQ 347 

MT2-MMP 246 G P P VQ P RAT ERP DQ YG PNI C DGD FDTVAMLRG EMFVF KGRWF WRVRHNR - 294 

MT1-MMP 313 TYGPNICDGNFDTVAMLRGEMFVFKERWFWRVRNNQ- 3 48 

MT3-MMP 3 32 RPS YPGAKPNICDGNFNTLAIL.RREMFVFKDQWFWRVRNNR- 3 72 

* * * * * ★ * ★ * ★** 



10 



15 



20 



MT4-MMP 27 9 LVSLQPAQMHRFWRGLPLHLDSVDAVYERTSDHKIVFFKGDRYXVVPKDNN 32 8 

F0 6B09 348 LVSPRPARLHRFWEGLPAQVRWQAAYARHRDGRILLFSGPQFWVFQDR- 3 96 

MT2 -MMP 29 5 VLDNY PMP I GHFWRGL PGD I S AAYERQ-DGRFVFFKGDRYWLFREAN 3 40 

MT1-MMP 349 VMDG Y PMP I GQFWRGL PAS IN TAYERK - DGKFVFFKGDKHWVFDEAS 394 

MT3-MMP 373 VMDG Y PMQ I T YFWRGL P P S I D AVYEN S - DGNFVF FKGNKYWFKDTT 418 

MT4-MMP 329 VEEGYPRPVSDFSLP- -PGG-IDAAFSWAHNDRTYFFKDQLYWRYDDHTR 375 

F06B09 397 QLEGGARPLTELGLP- - PGEEVDAVFSWPQNGKTYLVRGRQYWRYDEAAA 444 

MT2-MMP 3 41 LEPGYPQPLTSYGL-GIPYDRIDTAIWWEPTGHTFFFQEDRYWRFNEETQ 3 89 

MT1-MMP 395 LEPGYPKHIKELGR-GLPTDKIDAALFWMPNGKTYFFRGNKYYRFNEELR 443 

MT3-MMP 419 LQPGYPHDLITLGS-GIPPHGIDSAIWWEDVGKTYFFKGDRYWRYSEEMK 467 



MT4-MMP 376 HMDPGYPAQSPLWRGVPSTLDDAMRWS-DGASYFFRGQEYWKVLDGELEV 424 

F06B09 445 RPDPGYPRDLSLWEGAPPSPDDVTVSN-AGDTYFFKGAHYWRFPKNSIKT 493 

25 MT2-MMP 3 90 RGD PG Y P K P I S VWQG I PAS P KGAF L SNDAA YT YF YKGTKYWKFDNERLRM 43 9 

MT1-MMP 444 AVDS E YP KN I KVWEG I PE S PRG S FMG S DEVF T YF YKGNKYWKFNNQ KLKV 49 3 

MT3-MMP 468 TMDPGYPKPITVW:GIPESPQGAFVHKENGFTY?YKGKEYWKFNNQILKV 517 



30 



35 



MT4-MMP 42 5 APG Y PQ S TARDWLVCGD S QADGS VAAGV DAAEGPRAPPGQHDQ 4 67 

F06B09 49 4 EPDAPQPMGPNWLDCPAP : SSGPRAP RP 52 0 

MT2-MMP 440 EPGYPKSILRDFMGCQEHVEPGPRWPDVARPPFNPHGGAEPGADSAEGDV 489 

MT1-MMP 49 4 EPGYPKSALRDWMGCPSGGRPDE GTEEETEVIIIEV 529 

MT3-MMP 518 EPGYPRSILKDFMGCDGPTDRVKEG HSPPDDVDIVIKL 555 



40 



MT4-MMP . 468 SRS EDG YEVCSCT SGAS S P PGAPG PLVAAT 497 

F06B09 521 PKA TPVSETCDCQCELN QAAGRWPAP I 547 

MT2-MMP 49 0 GIXSDGDFGAGVNKDRGSRVWQMEEVARTVNVVMVLVPLLLLLCVLGL^ 53 9 

MT1-MMP 53 0 D EEGGGAVSAAAWLPVLLLLLVLAVGL 557 

MT3-MMP 556 DN *■ T AS TVKAI A IVI PC I LAL»C LLVLVY 582 



MT4-MMP 498 MLLLLP- PLSPGALWTAAQALTL 519 

45 F06B09 548 PLLLLP-LLVGGVASR 562 

MT2-MMP 540 ALVQMQRKGAPRVLLYCKRSLQEWV 564 

MT1-MMP 558 AVFFFRRHGTPRRLLYCQRSLLDKV 5 82 

MT3-MMP 583 TVFQFKRKGTPRHILYCKRSMQEWV 607 
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■ in the search for DC specific genes, a novel matrix- 
metalloproteinase homologue (MMP) from the Membrane-type 
Matrix Metalloproteinases (MT-MMP) family subclass was 
identified. Of interest, this novel gene designated 
F06B09 is predominantly expressed by both CD34+- and 
monocyte-derived DC and is down-regulated after DC 
maturation by CD40L L cells. The deduced protein 
sequence of F06B09 is clearly a member of the MMP family, 
characterized by the presence of a prodomain with the 
) activation locus containing the essential cysteine 

residue, a catalytic domain including the zinc-binding 
site with the consensus sequence HExGHxxxxxH and an 
hemopexin-like domain (Birkedal-Hansen (1995) Curr . Ooin. 
QsU Biol. 7:728-735; Shapiro (1998) mrr . Opin. Cell 
15 Biol. 10:602-608) . MMPs belonging to the metzincin (or 
Clan) superfamily, can be classified into at least four 
subfamilies of closely related members: collagenases , 
stromelysins, gelatinases, and MT-MMP, although there are 
some MMPs like the macrophage metalloelastase (Belaaoua D , 
et al (1995) q BjoX Chem. 270:14568-14575) and the 
stromelysin 3 (Basset, et al . (1990) Nature 348 : 699-704) 
that do not belong to these subclasses. According to its 
structural characteristic and its high level of homology 
with MT4-MMP (Puente, et al . (1996) Cancer Res. 56:944- 
25 949), F06B09 represents the fifth member of the MT-MMP 
subclass . All MT-MMPs present a putative transmembrane 
domain in the C-terminal portion and a characteristic 
insertion between the propeptide and the catalytic domain 
containing the consensus amino acid sequence RxK/RR 
30 (Basbaum and Werb (1996) mrr Onin. Cell Biol. 8:731- 

738) in order to activate MT-MMPs, this site is cleaved 
by enzymes called furin. Like other MMPs or matrixins, 
the novel MT-MMP contains noncatalytic domains, in 
addition to the protease domain, which are likely 
3 5 involved in interactions with substrates or other 

proteins. MT-MMPs have the ability to cleave substrates, 
e g other MMPs. Effectively, MT1-, MT2- and MT3-MMP 
can activate proMMP2 (progelatinase A) into MMP2 on the 
cell surface (Butler, et al . (1997) Knr .T , Biochem. 
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244:653-657; Knauper, et al . (1996) J. Biol. Chem. - 
271:17124-17131; Kolkenbrock, et al . (1997) Biol. Ch^m . 
378:71-76; Sato, et al . (1994) Nature 370:61-65; 
Strongin, et al.'(1993) J. Biol. Chem. 268:14033-14039; 
5 and Takino, et al - (1995) J . Biol. Chem. 270:23013- 
23020), which as an active form degrades type IV- 
collagen, the major component of basement membranes 
(Wilhelm, et al . (1989) J . Biol. Chem. 264:17213-17221). 
Thus, active MMP-2 (gelatinase A) plays a key role in the 

10 invasion of migrating cells into tissues. Several 
reports demonstrate that an overexpression of MMPs , 
especially MMP-2 and MMP-9, is associated with the 
invasive behavior of tumor cells (Shapiro (1998) Curr . 
Ooin. Cell Biol. 10:602-608; Stetler-Stevenson, et al. 

15 (1993) Annu. Rev. Cell Biol. 9:541-573). Concerning the 
MT4-MMP, the most homologous gene to F06B09, it is 
uncertain whether this MT-MMP can activate proMMP-2 . 
Phylogenet-ic analysis shows that among the MT-MMPs, 
MT4-MMP and F06B09 are distinguished from others and form 

2 0 a group of closely related proteins. It is speculated 

that these two MT-MMP share similar biological 
activities. It has been shown than MT-MMPs are 
overexpressed in cancers; in particular, high levels of 
MT1-MMP are associated with invasiveness of cervical 

25 cancer cells (Gilles, et al . (1996) Int. J. Cancer . 

65:209-213), breast, colon, neck'and lung carcinomas 
(Okada, et al . (1995) Proc . Natl. Acad. Sci . USA . 
92:2730-2734; Sato, et al . (1994) Nature 370:61-65; Ueno, 
et al. (1997) Cancer Res. 57:2055-2060) and gastric 

30 cancers (Mori, et al . (1997) Int. J. Cancer . 74:316-321). 
Furthermore, MT4-MMP was isolated from a breast carcinoma 
(Puente, et al . (1996) Cancer Res. 56:944-949). 
Interestingly, the novel MT-MMP identified in the present 
report constitutes the first MT-MMP isolated from a DC 

3 5 library. It will be interesting to further study F0 6B09 

expression in pathological tissues. 

MT-MMP participate not only to the control of cell 
migration and tissue remodeling through their involvement 
in degradation of extracellular matrix and, basement 
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membranes, but also in the processing of pro-enzymes and 
pro -cytokines . 

Of note, the genes for human MT1-MMP, 2, and 3 have 
been localized on three different chromosomes by in situ 
5 hybridization (Mattei, et al . (1997) Genomics . 40:168- 
169; Mignon, et al . (1995) Genomics. 28:360-361),. and 
while the novel MT-MMP gene is on the same chromosome as 
MT2-MMP, both genes are on different loci: MT2-MMP is on 
chromosome 16ql2 (Mattel, et al . (1997) Genomics . 40:168- 
10 169; Yasumitsu, et al . (1997) DNA Res. 4:77-79), and the 
novel MT-MMP is on chromosome 16pl3.3. 

The predominant expression of F06B09 in immature DC, 
and its putative membrane localization, suggests a role 
for this MT-MMP in degradation of extracellular matrix 
15 during DC migration. Furthermore, like MT1-MMP, which 
may trigger invasion by tumor cells by activating 
* progelatinase A on tumor cell surface, this novel MT-MMP 
could be involved in cancer invasion. 

The proteins of this invention are defined in part 

2 0 by their sequences, and by their physicochemical and 

biological properties. The biological properties of the 
human proteases described herein, e.g., human F06B09, are 
defined by their amino acid sequences, and mature sizes. 
They also should share certain biological enzymatic 
25 properties of their respective proteins. 

The human protease F06B09 translation product 
. exhibits structural motifs of a member of the matrix 
metalloproteinase family of proteases, more specifically 
to a family of matrix degrading proteinases . These 

3 0 proteins, in the latent form, typically possess a 

prodomain form which masks the catalytic site, which 
chelates a zinc ion. See Vallee and Auld (1990) 
Biochemistry 29:5647-5659. The processed mature protein 
is typically a potent cell-matrix degrading enzyme. See, 
3 5 e.g., Birkedal-Hansen (1990) Proc . Nat ' 1 Acad. Sci . USA 

87:5578-5582. The enzyme may remain attached to the cell 
membrane after activation, and many of these proteases 
may localize to the leading cellular processes when the 
cell migrates. This may suggest other protein-protein 
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interactions, e.g., with domains which specifically 
localize- the enzyines by cytoskeletal or other mechanisms. 

The pro-enzyme activation (furin) site would 
correspond to the Arg stretch from 86-90 of SEQ ID NO: 2; 
5 or 212-222 of SEQ ID NO : 4 . It is likely that the 
activating enzyme will be one of the furin/ PACE 
proteases, which are essentially ubiquitously expressed. 
By analogy to the MT1-MMP, these proteases activate other 
proteases, e.g., gelatinase A, collagenases , and others, 
10 that assist in the degradation of the extracellular 
matrix. See, e.g. / Basbaum and Werb (1996) Current 
Opinion in Cell Biology 8:731-738. 

F06B09 contains the zinc binding peptide consensus 
at about residues 212-222 of SEQ ID NO: 4, with 
15 characteristic His residues at 212, 216, and 222. There 
is also a hemopexin-like domain at about residues 291-525 
and a matrixin-like domain corresponding to about 
residues 1-211- Natural substrates for the proteinase 
may be identified using standard methods. Substrate 
20 sequence specificity may be determined, and search for 
such sequences in databases may identify specific 
candidates for physiological substrates. 

One of skill will readily recognize that some 
sequence variations may be tolerated, e.g., conservative 
25 substitutions or positions remote from the critical 
helical structures and remote from the identified or 
consensus critical active site regions, without altering 
significantly the biological activity of each respective 
molecule. 

3 0 F06B09 proteins are present in specific cell types, 

e.g., dendritic cells, and the interaction of the 
protease with a substrate will be important for mediating 
various aspects of cellular physiology or development. 
The cellular types which express messages encoding F06B09 
35 suggest that signals important in cell differentiation 

and development are mediated by them. See, e.g., Gilbert 
(1991) Developmental Biology (3d ed. ) Sinauer Associates, 
Sunderland, MA; Browder, et al . (1991) Developmental 
Biology (3d ed. ) Saunders, Philadelphia, PA.; Russo, et 
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al, (1992) DeveloDinpnt: The Molecular Genetic Approach 
Springer-Verlag, New York, N.Y.; and Wilkins (1993) 
Genetic Analysis of Anima l Development (2d ed. ) Wiley- 
Liss, New York, N.Y. In particular, the proteases may be 
5 necessary for the conversion of pro-proteins to proteins, 
e.g., cytokine or protein precursors to mature forms, or 
for proper immunological function, e.g., antigen 
processing and presentation. Alternatively, the • 
proteases may be important in dendritic cell trafficking, 
10 e.g., to traverse through extracellular matrix or 
vascular surfaces . 

II. Definitions 

The term "binding composition" refers to molecules 
15 that bind with specificity to F06B09, e.g., in an 

antibody- antigen interaction. However, other compounds, 
e.g., complex associated proteins, may also specifically 
associate with F06B09 to the exclusion of other 
molecules. Typically, the association will be in a 
20 natural physiologically relevant protein-protein 

interaction, either covalent or non-covalent , and may 
include members of a multiprotein complex, including 
carrier compounds or dimerization partners. The molecule 
may be a polymer, or chemical reagent. A functional - 
25 analog may be a protease with structural modifications, 
or may be a wholly unrelated molecule, e.g-, which has a 
molecular shape which interacts with the appropriate 
substrate cleavage determinants. 

The term "binding agent:F06B09 protein complex", as 
30 used herein, refers to a complex of a binding agent and 
an F06B09 protein that is formed by specific binding of 
the binding agent to the F06B09 protein. Specific 
binding of the binding agent means that the binding agent 
has a specific binding site that recognizes a site on the 
35 F06B09 protein, typically in the native conformation, but 
possibly in a denatured conformation, e.g., a Western 
blot. For example, antibodies raised to an F06B09 
protein and recognizing an epitope on the F06B09 protein 
are capable of forming a binding agent :F06B09 protein 
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complex by specific binding. Typically, the formation of 
a binding agent :F0 6B09 protein complex allows the 
measurement of F06B09 protein in a biological sample, 
e.g., a mixture with other proteins and biologies. The 
5 term " antibody : F06B09 protein complex" refers to an 
embodiment in which the binding agent is an antibody. 
The antibody may be monoclonal, polyclonal, or a binding 
fragment of an antibody, e.g., an Fab, F(ab)2, or Fv 
. fragment. The antibody will preferably be a polyclonal 

10 antibody for cross-reactivity determinations. 

"Homologous" nucleic acid sequences, when compared, 
exhibit significant similarity or identity. The 
standards for homology in nucleic acids are either 
measures for homology generally used in the art by 

15 sequence comparison and/or phylogenetic relationship, or 
based upon hybridization conditions. Hybridization 
conditions are described in greater detail below. 

An "isolated" nucleic acid is a nucleic acid, e.g., 
an RNA, DNA, or a mixed polymer, which is substantially 

20 separated from other biologic components which naturally 
accompany a native sequence, e.g., proteins and flanking 
genomic sequences from the originating species. The term 
embraces a nucleic acid sequence which has been removed 
from its naturally occurring environment, and includes 

25 recombinant or cloned DNA isolates and chemically 

synthesized analogs, or analogs biologically synthesized 
by heterologous systems. A substantially pure molecule 
includes isolated forms of the molecule. An isolated 
nucleic acid will usually contain homogeneous nucleic 

30 acid molecules, but will, in some embodiments, contain 
nucleic acids with minor sequence heterogeneity. This 
heterogeneity is typically found at the polymer ends or 
portions not critical to a desired biological function or 
activity. 

35 As used herein, the term "F06B09" protein shall 

encompass, when used in a protein context, a protein 
having amino acid sequences, particularly from the 
protein motif portions, shown in SEQ ID NO: 2 or 4, 
respectively. In many contexts, a significant fragment 
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of such a protein will be functionally equivalent. The 
invention also embraces a polypeptide which exhibits 
similar structure to human F06B09 protein, e.g., which 
interacts with F06B09 specific binding components. These 
5 binding components, e.g., antibodies, typically bind to 
F06B09 protein with high affinity, e.g., at least about 
100 nM, usually better than about 3 0 nM, preferably 
better than about 10 nM, and more preferably at better 
than about 3 nM. 
10 The term "polypeptide" or "protein" as used herein 

includes a significant fragment or segment of protease 
motif portion of F06B09 protein, and encompasses a 
stretch of amino acid residues of at least about 8 amino 
acids, generally at least about 10 amino acids, more 
15 generally at least about 12 amino acids, often at least 
about 14 amino acids, more often at least about 16 amino 
acids, typically at least about 18 amino acids, more 
typically .at least about 20 amino acids, usually at least 
about 22 amino acids, more usually at least about 24 
20 amino acids, preferably at least about 2 6 amino acids, 
more preferably at least about 28 amino acids, and, in 
particularly preferred embodiments, at least about 3 0 or 
more amino acids, e.g., 35, 40, 45, 50, 60, 70, 80, 100, 
etc . Preferred ends of such polypeptides will correspond^ 
25 to a motif or boundary described above, e.g., in Table 1. 
Preferably, a polypeptide will contain a plurality of 
distinct, e.g., discrete or nonover lapping, segments of 
the specified length. Typically, the plurality will be 
at least two, more usually at least three, and preferably 
3 0 5, 7, or even more. While the length minima are 

provided, longer lengths, of various sizes, may be 
appropriate, e.g., one of length 7, and two of length 12. 

A "recombinant" nucleic acid is defined either by 
its method of production or. its structure. In reference 
35 to its method of production, e.g., a product made by a 
process, the process is use of recombinant nucleic acid 
techniques, e.g., involving human intervention in the 
nucleotide sequence, typically selection or production. 
Alternatively, it can be a nucleic acid made by 
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generating a sequence comprising fusion of two fragments 
which are not naturally contiguous to each other, but is 
meant to exclude products of nature, e.g., naturally 
occurring mutants. Thus, for example, products made by 
5 transforming cells with any non-naturally occurring 

vector is encompassed, as are nucleic acids comprising 
sequence derived using any synthetic oligonucleotide 
process. Such is often done to replace a codon with a 
redundant codon encoding the same or a conservative amino 
10 acid, while typically introducing or removing a sequence 
recognition site. Alternatively, it is performed to join 
together nucleic acid segments of desired functions to 
generate a single genetic entity comprising a desired 
combination of functions not found in the commonly 
15 available natural ■ forms . Restriction enzyme recognition 
sites are often the target of such artificial 
manipulations, but other site specific targets, e.g., 
promoters, DNA replication sites, regulation sequences, 
control sequences, or other useful features may be 
20 incorporated by design. A similar concept is intended 
for a recombinant, e.g., fusion, polypeptide. 
Specifically included are synthetic nucleic acids which, 
by genetic code redundancy, encode polypeptides similar 
to fragments of these antigens, and fusions of sequences 
25 from various different species variants. 

"Solubility" is reflected by sedimentation measured 
in Svedberg units, which are a measure of the 
.sedimentation velocity of a molecule under particular 
conditions. The determination of the sedimentation 
3 0 velocity was classically performed in an analytical 
ultracentrifuge, but is typically now performed in a 
standard ultracentrifuge. See, Freif elder (19 82) 
Physical Biochemistry (2d ed.) W.H. Freeman & Co., San 
Francisco, CA; and Cantor and Schimmel (1980) Biophysical 
35 Chemistry parts 1-3, W.H. Freeman & Co., San Francisco, 
CA. As a crude determination., a sample containing a 
putatively soluble polypeptide is spun in a standard full 
sized ultracentrifuge at about 50K rpm for about 10 
minutes, and soluble molecules will remain in the 
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supernatant. A soluble particle or polypeptide will 
typically be less than about 3 OS, more typically less 
than about 15 S, usually less than about 10 S, more usually 
less than about 6S, and, in particular embodiments, 
5 preferably less than about 4S, and more preferably less 
than about 3S. Solubility of a polypeptide or fragment 
depends upon the environment and the polypeptide. Many 
parameters affect polypeptide solubility, including 
temperature, electrolyte environment, size and molecular 
10 characteristics of the polypeptide, and nature of the 
solvent. Typically, the temperature at which the 
polypeptide is used ranges from about 4° C to about 65° 
C. Usually the temperature at use is greater than about 
18° C and more usually greater than about 22° C. For 
15 diagnostic purposes, the temperature will usually be 
about room temperature or warmer, but less than the 
denaturation temperature of components in the assay. For 
therapeutic purposes, the temperature will usually be 
body temperature, typically about 3 7° C for humans, 
20 though under certain situations the temperature may be 
raised or lowered in situ or in vitro. 

The size and structure of the polypeptide should 
generally be in a substantially stable state, and usually 
not in a denatured state. The polypeptide may be 
25 associated with other polypeptides in a quaternary 

structure, e.g., to confer solubility, or associated with 
lipids or detergents in a manner which approximates 
natural lipid bilayer interactions. 

The solvent will usually be a biologically 
3 0 compatible buffer,, of a type used for preservation of 
biological activities, and will usually approximate a 
physiological solvent. Usually the solvent will have a 
neutral pH, typically between about 5 and 10, and 
preferably about 7.5. On some occasions , a detergent 
3 5 will be added, typically a mild non- denaturing one, e.g., 
CHS (cholesteryl hemisuccinate) or CHAPS (3-[3- 
cholamidopropyl) - dime thy lammonio] -1-propane sulfonate), 
or a low enough concentration as to avoid significant 
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disruption of structural or physiological properties of 
the protein. 

"Substantially pure" in a protein context typically 
means that the protein is isolated from other 
5 contaminating proteins, nucleic acids, and other 

biologicals derived from the original source organism. 
Purity, or "isolation" may be assayed by standard 
methods, and will ordinarily be at least about 50% pure, 
more ordinarily at least about 60% pure, generally at 

10 least about 70% pure, more generally at least about 80% 
pure, often at least about 85% pure, more often at least 
about 90% pure, preferably at least about 95% pure, more 
preferably at least about 98% pure, and in most preferred' 
embodiments, at least 99% pure. Similar concepts apply, 

15 e.g., to antibodies or nucleic acids. 

"Substantial similarity" in the nucleic acid 
sequence comparison context means either that the 
segments, or their complementary strands, when compared, 
are identical when optimally aligned, with appropriate 

20 nucleotide insertions or deletions, in at least about 50% 
of the nucleotides, generally at least about 56%, more 
generally at least about 59%, ordinarily at least about 
62%, more ordinarily at least about 65%, often at least 
about 68%, more often at least about 71%, typically at 

25 least about 74%, more typically at least about 77%, 

usually at least about 80%, more 'usually at least about 
85%, preferably at least about 90%, more preferably at 
least about 95 to 98% or more, and in particular 
embodiments, as high at about 99% or more of the 

30 nucleotides. Alternatively, substantial similarity 

exists when the segments will hybridize under selective 
hybridization conditions, to a strand, or its complement, 
typically using a sequence derived from SEQ ID NO: 1 or 
3. Typically, selective hybridization will occur when 

3 5 there is at least about 55% similarity over a stretch of 
at least about 3 0 nucleotides, preferably at least about 
65% over a stretch of at 1 east about 25 nucleotides, more 
preferably at least about 75%, and most preferably at 
least about 90% over about 20 nucleotides. See Kanehisa 
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(1984) Nuc] a^rls Res. 12:203-213. The length of 
similarity comparison, as described, may be over longer 
stretches, and in certain embodiments will be over a 
stretch of at least about 17 nucleotides, usually at 
5 least about 20 nucleotides, more usually at least about 
24 nucleotides, typically at least about 28 nucleotides, 
more typically at least about 40 nucleotides, preferably 
at least about 50 nucleotides, and more preferably at 
least about 75 to 100 or more nucleotides, e.g., 150, 
10 2 00, etc. Various combinations of plurality of such 
segments will also be made. 

For sequence comparison, typically one sequence acts 
:as a reference sequence, to which test sequences are 
compared. When using a sequence comparison algorithm, 
15 test and reference sequences are input into a computer, 

subsequence coordinates are designated, if necessary, and 
sequence algorithm program parameters are designated. 
The sequence comparison algorithm then calculates the 
percent sequence identity for the test sequence(s) 
20 relative to the reference sequence, based on the 
designated program parameters. 

Optical alignment of sequences for comparison can be 
conducted, e.g., by the local homology algorithm of Smith 
and Waterman (1981) kdv ftppl ■ Math. 2:482, by the 
25 homology alignment algorithm of Needlman and Wunsch 

(1970) .t Mr>1 . Biol. 48:443, by the search for similarity 
method of Pearson and Lipman (1988) Pror Naf 1 Acad. 
firi USA 85:2444, by computerized implementations of 
these algorithms (GAP , BESTFIT, FASTA, and TFASTA in the 
30 Wisconsin Genetics Software Package, Genetics Computer 
Group, 575 Science Dr., Madison, WI) . or by visual 
inspection (see generally Ausubel et al . , supra). 

One example of a useful algorithm is PILEUP. PILEUP 
creates a multiple sequence alignment from a group of 
3 5 related sequences using progressive, pairwise alignments 
to show relationship and percent sequence identity. It 
also plots a tree or dendogram showing the clustering 
relationships used to create the alignment. PILEUP uses 
a simplification of the progressive alignment method of 
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Feng and Doolittle (1.987) J . Mol . Evol . 35:351-360. The 
method used is similar to the method described by Higgins 
and Sharp (1989) CABIOS 5:151-153. The program can align 
up to 3 00 sequences, each of a maximum length of 5,000 
5 nucleotides or amino acids. The multiple alignment 

procedure begins with the pairwise alignment of the two 
most similar sequences, producing a cluster of two 
aligned sequences. This cluster is then aligned to the 
next most related sequence or cluster of aligned 

10 sequences. Two clusters of sequences are aligned by'a 
simple extension of the pairwise alignment of two 
individual sequences . The final alignment is achieved by 
a series of progressive, pairwise alignments. The 
program is run by designating specific sequences and 

15 their amino acid or nucleotide coordinates for regions of 
sequence comparison and by designating the program 
parameters. For example, a reference sequence can be 
compared to other test sequences to determine the percent 
sequence identity relationship using the following 

20 parameters: default gap weight (3.00), default gap length 
weight (0.10), and weighted end gaps. 

Another . example of algorithm that is suitable for 
determining percent sequence identity and sequence 
similarity is the BLAST algorithm, which is described 

25 Altschul, et al . (1990) J. Mol. Biol. 215:403-410. 
Software for performing BLAST analyses is publicly 
available through the National Center for Biotechnology 
Information (http:www.ncbi.nlm.nih.gov/) . This algorithm 
involves first identifying high scoring sequence pairs 

30 (HSPs) by identifying short words of length W in the 
query sequence, which either match or satisfy some 
positive-valued threshold score T when aligned with a 
word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold 

35 (Altschul, et al . , supra). These initial neighborhood 
word hits act as seeds for initiating searches to find 
longer HSPs containing them. The word hits are then 
extended in both directions along each sequence for as 
far as the cumulative alignment score can be increased. 
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Extension of the word hits in each direction are halted 
when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the 
cumulative score goes to zero or below, due to the 
5 accumulation of one or more negative- scoring residue 
alignments; or the end of either sequence is reached. 
The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLAST 
program uses as defaults a wordlength (W) of 11, the 
10 BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) 
Pr oc . Nat 1 1 Acad . Sci . USA 89:10915) alignments (B) of 
50, expectation (E) of 10, M=5, N=4 , and a comparison of 
both strands . 

In addition to calculating percent sequence 
15 identity, the BLAST algorithm also performs a statistical 
analysis of the similarity between two sequences (see, 
e.g., Karlin and Altschul (1993) Proc. Nat ' 1 Acad. Sci. 
USA 90:5873-5787) . One measure of similarity provided by 
the BLAST algorithm is the smallest sum probability 
20 (P(N)), which provides an indication of the probability 
by which a match between two nucleotide or amino acid 
sequences would occur by chance. For example, a nucleic 
acid is considered similar to a reference sequence if the 
smallest sum probability in a comparison of the test 
25 nucleic acid to the reference nucleic acid is less than 

about 0.1, more preferably less than about 0.01, and most 
preferably less than about 0.001. 

A further indication that two nucleic acid sequences 
of polypeptides are substantially identical is that the 
3 0 polypeptide encoded by the first nucleic acid is 

immunologically cross reactive with the polypeptide 
encoded by the second nucleic acid, as described below. 
Thus, a polypeptide is typically substantially identical 
to a second polypeptide, for example, where the two 
35 peptides differ only by conservative substitutions. 

Another indication that two nucleic acid sequences are 
substantially identical is that the two molecules 
hybridize to each other under stringent conditions, as 
described below. 
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"Stringent conditions", in referring to homology or 
substantial similarity in the hybridization context, will 
be stringent combined conditions of salt, temperature, 
organic solvents, and other parameters, typically those 
5 controlled in hybridization reactions. The combination 
of parameters is generally more important than the 
measure of any single parameter. See, e.g., Wetmur and 
Davidson (1968) J. Mol . Biol. 31:349-370. A nucleic acid 
probe which binds to a target nucleic acid under 

10 stringent conditions is specific for said target nucleic 
acid. Such a probe is typically more than 11 nucleotides 
in length, and is sufficiently identical or complementary 
to a target nucleic acid over the region specified by the 
sequence of the probe to bind the target under stringent 

15 hybridization conditions. Hybridization under stringent 
conditions should give a background of at least 2-fold 
over background, preferably at least 3-5 or more. 

F0 6B09 proteins from other mammalian species can be 
cloned and" isolated by cross-species hybridization of 

20 closely related species. See, e.g., below. Similarity 
ma y be relatively low between distantly related species, 
and thus hybridization of relatively closely related 
species is advisable. Alternatively, preparation of an 
antibody preparation which exhibits less species 

25 specificity may be useful in expression cloning 
approaches . 

The phrase "specifically binds to an antibody" or 
"specifically immunoreactive with", when referring to a 
protein or peptide, refers to a binding reaction which is 

3 0 determinative of the presence of the protein in the 

presence of a heterogeneous population of proteins and . 
other biological components. Thus, under designated 
immunoassay conditions, the specified .antibodies bind to 
a particular protein and do not significantly bind other 

3 5 proteins present in the sample. Specific binding to an 
antibody under such conditions may require an antibody 
that is selected for its specificity for a particular 
protein. For example, antibodies raised to the human 
F0 6B0 9 protein immunogen with the amino acid sequence 
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depicted in SEQ ID NO: 2 or 4 can be selected by 
immunoaf f inity or similar methods to obtain antibodies 
specifically immunoreactive with F06B09 proteins and not 
with other proteins . 

5 

III. Nucleic Acids 

F06B09 proteins are exemplary of larger classes of 
structurally and functionally related proteins. The 
F06B09 proteins will typically serve to cleave or process 
10 various proteins produced or processed by various cell 
types, e.g., for antigen presentation . The preferred 
embodiments, as disclosed, will be useful in standard 
procedures to isolate genes from different individuals or 
other species, e.g., warm blooded animals, such as birds 
15 and mammals. Cross hybridization will allow isolation of 
related genes encoding proteins from individuals, 
strains, or species. A number of different approaches 
are available to successfully isolate a suitable nucleic 
acid clone based upon the information provided herein. 
20 Southern blot hybridization studies can qualitatively 
determine the presence of homologous genes in human, 
monkey, rat, dog, cow, and rabbit genomes under specific 
hybridization conditions. 

Complementary sequences will also be used as probes 
25 or primers. Based upon identification of the likely 
amino terminus, other peptides should be particularly 
useful, e.g., coupled with anchored vector or poly-A 
complementary PCR techniques or with complementary DNA of 
other peptides. 
3 0 Techniques for nucleic acid manipulation of genes 

encoding F06B09 proteins, such as subcloning nucleic acid 
sequences encoding polypeptides into expression vectors, 
labeling probes, DNA hybridization, and the like are 
described generally in Sambrook, et al . (1989) Molecular 
35 Cloning: A Laboratory Manual (2nd ed. ) Vol. 1-3, Cold 

Spring Harbor Laboratory, Cold Spring Harbor Press, NY, 
which is incorporated herein by reference. This manual 
is hereinafter referred to as "Sambrook, et al . " 
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There are various methods of isolating DNA sequences 
encoding F06B09 proteins. For example, DNA is isolated 
from a genomic or cDNA library using labeled 
oligonucleotide probes having sequences identical or 
5 complementary to the sequences disclosed herein. Full- 
length probes may be used, or oligonucleotide probes may 
be generated by comparison of the sequences disclosed. 
Such probes can be used directly- in hybridization assays 
to isolate DNA encoding F06B09 proteins, or probes can be 
10 designed for use in amplification techniques such as PCR, 
for the isolation of DNA encoding F06B09 proteins. 

To prepare a cDNA library, mRNA is isolated from 
cells, preferably which express high levels of an F0 6B09 
protein. cDNA is prepared from the mRNA and ligated, 
15 e.g., into a recombinant vector. The vector is 

trans feet ed into a recombinant host for propagation, 
screening, and cloning. Methods for making and screening 
cDNA libraries are well known. See Gubler and Hoffman 
(1983) Gene 25:263-269 and Sambrook, et al . 
2 0 For a genomic library, the DNA can be extracted from, 

tissue, and often either mechanically sheared or 
enzymatically digested to yield fragments of about 12-20 
kb. The fragments are then separated by gradient 
centrif ugation and cloned in bacteriophage lambda 
25 vectors. These vectors and phage are packaged in vitro, 
as described in Sambrook, et al. ' Recombinant phage are 
analyzed by plaque hybridization as described in Benton 
and Davis (1977) Science 196:180-182. Colony 
hybridization is carried out as generally described in, 
30 e.g., Grunstein, et al. (1975) Proc. Natl. Acad. Sci. 
USA. 72:3961-3965. 

DNA encoding an F06B09 protein can be identified in 
either cDNA or genomic libraries by its ability to 
hybridize with the nucleic acid probes described herein, 
35 e.g., in colony or plaque hybridization assays. The 

corresponding DNA regions are isolated, e.g., by standard 
methods familiar to those of skill in the art. See, 
e.g., Sambrook, et al . 
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Various methods of amplifying target sequences, such 
as the polymerase chain reaction, can also be used to 
prepare DNA encoding F06B09 proteins. Polymerase chain 
reaction (PCR) technology may be used to amplify such 
5 nucleic acid sequences directly from mRNA, from cDNA, 
and/or from genomic libraries or cDNA libraries. . The 
isolated sequences encoding F06B09 proteins may also be 
used as templates for PCR amplification. 

Typically, in PCR techniques, oligonucleotide 
10 primers complementary to two flanking regions in the DNA 
region to be amplified are synthesized. The polymerase 
chain reaction is then carried out using the two primers. 
See Innis, et al . (eds. 1990) PCR Protocols: A Guide to 
Methods "and Applications Academic Press, San Diego, CA. 
15 Primers can be selected to amplify the entire regions 

encoding a full-length human F06B09 protein or to amplify 
smaller DNA segments, as desired. Once such regions are 
PCR-amplif ied, they can be sequenced and oligonucleotide 
probes can be prepared from sequence obtained using 
20 standard techniques. These probes can then be used to 
isolate DNA's encoding F06B09 proteins. 

Oligonucleotides for use. as probes are usually 
chemically synthesized according to the solid phase 
phosphoramidite triester method first described by 
25 Beaucage and Carruthers (1983) Tetrahedron Lett. 

22 (20) : 1859-1862 , or using an automated synthesizer, as 
described in Needham-VanDevanter , et al . (19 84) Nucleic 
Acids Res. 12:6159-6168. Purification of 

oligonucleotides is performed e.g., by native acrylamide . 

3 0 gel electrophoresis or by ani on-exchange HPLC as 
described in Pearson and Regnier (1983) J . Chrom , 
255:137-149. The sequence of the synthetic 
oligonucleotide can be verified using, e.g., the chemical 
degradation method of Maxam, A.M. and Gilbert, W. in 

35 Grossman and Moldave (eds. 1980) Methods in EnzymoloqY 
65:499-560, Academic Press, New York. 

An isolated- nucleic acid encoding a human F06B09 
protein was identified. The nucleotide sequence, 
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corresponding open reading frames, and mature peptides 
are provided in Table 1 or SEQ ID NO: 1 or 3 . 

This invention provides isolated DNA or fragments to 
encode an F06B09 protein or specific fragment thereof. 
5 In addition, this invention provides isolated or 

recombinant DNA which encodes a protein or polypeptide, 
and which is capable of hybridizing under appropriate 
conditions, e.g., high stringency, with the DNA sequences 
described herein. Said biologically active protein or 

10 polypeptide can be a functional protease segment, or 

fragment, and have an amino acid sequence as disclosed in 
SEQ ID NO: 2 or 4. Preferred embodiments will be full 
length natural sequences, from isolates, or proteolytic 
fragments thereof. Further, this invention contemplates 

15 the use of isolated or recombinant DNA, or fragments 
.thereof, which encode proteins which exhibit high 
measures of identity to an F0 6B0 9 protein, or which were 
isolated, e.g., using cDNA encoding an F06B09 protease 
polypeptide as a probe. The isolated DNA can have the 

20 respective regulatory sequences in the 5' and 3* flanks, 
e.g., promoters , enhancers , poly-A addition signals , and 
others . 

IV. Making human F06B09 proteins 
25 DNAs which encode an F06B09 protein, or fragments 

thereof, can be obtained by chemical synthesis, screening 
cDNA libraries, or by screening genomic libraries 
prepared from a wide variety of cell lines or tissue 
samples . 

3 0 These DNAs can be expressed in a wide variety of 

host cells for the synthesis of a full-length protein or 
fragments which can in turn, e.g., be used to generate 
polyclonal or monoclonal antibodies; for binding studies; 
for construction and expression of modified molecules; 

35 and for structure/ function studies.. Each of F06B09, or 
their fragments, can be expressed in host cells that are 
transformed or transfected with appropriate expression 
vectors. These molecules can be substantially purified 
to be free of protein or cellular contaminants, other 
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than those derived from the recombinant host, and 
therefore are particularly useful in pharmaceutical 
compositions when combined with a pharmaceutically 
acceptable carrier and/or diluent. The antigen, e.g., 
5 F06B09, or portions thereof, may be expressed as fusions 
with other proteins or possessing an epitope tag.. 

Expression vectors are typically self -replicating 
DNA or RNA constructs containing the desired antigen gene 
or its fragments, usually operably linked to appropriate 
10 genetic control elements that are recognized in a 
suitable host cell. The specific type of control 
elements necessary to effect expression will depend upon 
the eventual host cell used. ' Generally, the genetic 
control elements can include a prokaryotic promoter 
15 system or a eukaryotic promoter expression control 

system, and typically include a transcriptional promoter, 
an optional operator to control the onset of 
transcription, transcription enhancers to elevate the 
level of mRNA expression, a sequence that encodes a 
20 suitable ribosome binding site, and sequences that 

terminate transcription and translation. Expression 
vectors also usually contain an origin of replication 
that allows the vector to replicate independently from 
the host cell. 

25 The vectors of this . invention contain DNAs which, 

encode an F06B09 protein, or a significant fragment 
thereof, typically encoding, e.g., a biologically active 
polypeptide, or protein. The DNA can be under the 
control of a viral promoter-and can encode a selection 
3 0 marker. This invention further contemplates use of such 
expression vectors which are capable of expressing 
eukaryotic cDNA coding for an F06B09 protein in a 
prokaryotic or eukaryotic host, where the vector is 
compatible with the host and where the eukaryotic cDNA 
35 coding for the protein is inserted into the vector such 
that growth of the host containing the vector expresses 
the cDNA in question. Usually, expression vectors are 
designed for stable replication in their host cells or 
for amplification to greatly increase the total number of 
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copies of the desirable gene per cell. It is not always 
necessary to require that an expression vector replicate 
in a host cell, e.g., it is possible to effect transient 
expression of the protein or its fragments in various 
5 hosts using vectors that do not contain a replication 
origin that is recognized by the host cell. It is also 
possible to use vectors that cause integration of an 
F06B09 protein gene or its fragments into the host DNA by 
recombination, or to integrate a promoter which controls 

10 expression of an endogenous gene. 

Vectors, as used herein, contemplate plasmids, 
viruses, bacteriophage, integratable DNA fragments, and 
other vehicles which enable the integration of DNA 
fragments into the genome of the host. Expression. 

15 vectors are specialized vectors which contain genetic 
control elements that effect expression of operably 
linked genes. Plasmids are the most commonly used form 
of vector, but many other forms of vectors which serve an 
equivalent function are suitable for use herein. See, 

20 e.g., Pouwels, et al . (1985 and Supplements) Cloning 
Vectors: A Laboratory Manual Elsevier, N.Y.; and 
Rodriguez, et al . (eds. 1988) Vectors : A Survey of 
Molecular Cloning Vectors and Their Uses Buttersworth, 
Boston, MA. 

25 Suitable host cells include prokaryotes , lower 

eukaryotes, and higher eukaryotes. Prokaryotes include 
both gram negative and gram positive organisms, e.g., E. 
coli and B. subtilis . Lower eukaryotes include yeasts, 
e.g., S. cerevisiae and Pichia, and species of the genus 

3 0 Dictyostelium. Higher eukaryotes include established 
tissue culture cell lines from animal cells, both of 
non-mammalian origin, e.g., insect cells, and birds, and 
of mammalian origin, e.g., human, primates, and rodents. 
Prokaryotic host-vector systems include a wide 

35 variety of vectors for many different species. As used 
herein, E. coli and its vectors will be used generically 
to include equivalent vectors used in other prokaryotes. 
A representative vector for amplifying DNA is pBR322 or 
its derivatives. Vectors that can be used to express 
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F06B09 proteins or fragments thereof include, but are not 
limited to, such vectors as those containing the lac 
promoter (pUC-series) ; trp promoter (pBR322-trp) ; .Ipp 
promoter (the pIN-series) ; lambda-pP or pR promoters 
5 (pOTS) ; or hybrid promoters such as ptac (pDR540) . See 
Brosius, et al . (1988) "Expression Vectors Employing 
Lambda-, trp-, lac-, and Ipp-derived Promoters", in 
Rodriguez and Denhardt ( eds . ) Vectors: A Survey of 
Molecular Cloning Vector s and Their Uses 10:205-236 
10 Buttersworth, Boston, MA. 

Lower eukaryotes , e.g., yeasts and Dictyostelium, 
may be transformed with F06B09 protein sequence 
containing vectors. For purposes of this invention, the 
most common lower eukaryotic host is the baker's yeast, 
15 Saccharomyces cerevisiae. It will be used genericaily to 
represent lower eukaryotes although a number of other 
strains and species will be essentially equivalent. 
Yeast vectors typically consist of a replication origin 
(unless of the integrating type) , one or more selection 
20 genes, a promoter, DNA encoding the desired protein or 

its fragments, and sequences for translation termination, 
polyadenylation, and transcription termination. Suitable 
expression vectors for yeast include such constitutive 
promoters as 3 -phosphoglycerate kinase and various other 
25 glycolytic enzyme gene promoters or such inducible 

promoters as the alcohol dehydrogenase 2 promoter or 
metallothionine promoter. Suitable vectors include 
derivatives of the following types: self -replicating low 
copy number (such as the YRp-series) , self -replicating 
30 high copy number (such as the YEp-series) , integrating 

types (such as the Yip-series) , or mini -chromosomes (such 
as. the YCp- series) . 

Higher eukaryotic tissue culture cells are typically 
the preferred host cells for expression of the 
35 functionally active F06B09 protease polypeptides. In 
principle, many higher eukaryotic tissue culture cell 
lines may be used, e.g., insect baculovirus expression 
systems, whether from an invertebrate or vertebrate 
source. However, mammalian cells are preferred to 
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achieve proper natural processing, both cotranslationally 
and posttranslationally. Transformation or transfection 
and propagation o-f such cells are routine. Useful cell 
lines include HeLa cells, Chinese hamster ovary (CHO) 
5 cell lines, baby rat kidney (BRK) cell lines, insect cell 
lines, bird cell lines, and monkey (COS) cell lines. 
Expression vectors for such cell lines usually include an 
origin of replication, a promoter, a translation 
initiation site, RNA splice sites (e.g., if genomic DNA 
10 is used) , a polyadenylation site, and a transcription 
termination site. These vectors also may contain 
selection and/or amplification genes. Suitable 
expression vectors may be plasmids, viruses, or 
retroviruses carrying promoters derived, e.g., from such 
15 sources as from adenovirus, SV40, parvoviruses, vaccinia 
virus, or cytomegalovirus. Representative examples of 
suitable expression vectors include pCDNAl; pCD, see 
Okayama, et al . (1985) Mnl r^ll Biol. 5:1136-1142; 
pMClneo Poly-A, see Thomas, et al . (19 87) Cell 51:503- 
20 512; and a baculovirus vector such as pAC 373 or pAC 610. 
It is likely that F06B09 protein need not be 
glycosylated to elicit biological responses. However, it 
will occasionally be desirable to express an F06B09 
polypeptide in a system which provides a specific or 
25 defined glycosylation pattern. In this case, the usual 

pattern will be that provided naturally by the expression 
system. However, the pattern will be modifiable by 
exposing the polypeptide, e.g., in unglycosylated form, 
to appropriate glycosylating proteins introduced into a 
30 heterologous expression system. For example, an F06B09 

protein gene may be co-transformed with one or more genes 
encoding mammalian or other glycosylating enzymes. It is 
further understood that over glycosylation may be 
detrimental to F06B09 protein biological activity, and 
35 that one of skill may perform routine testing to optimize 
the degree of glycosylation which confers optimal 
biological activity. 

An F06B09 protein, or a fragment thereof, may be 
engineered to be phosphatidyl inositol (PI) linked to a 
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cell membrane, but can be removed from membranes by 
treatment with a phosphatidyl inositol cleaving enzyme, 
e.g., phosphatidyl inositol phospholipase-C . This 
releases the antigen in a biologically active form, and 
5 allows purification by standard procedures of protein 

chemistry. See, e.g., Low (1989) Biochem. Bioohvs . Acta 
988:427-454; Tse, et al . (1985) Science 230:1003-1008; 
and Brunner, et al . (1991) ,T. Cell Biol. 114:1275-1283. 
Now that F06B09 proteins have been characterized, 
10 fragments or derivatives thereof can be prepared by 

conventional processes for synthesizing peptides. These 
include processes such as are described in Stewart and 
Young (1984) Soli ri Pha .qg Peptide Synthesis Pierce 
Chemical Co., Rockford, IL; Bodanszky and Bodanszky 
15 (1984) The Practice of Peptide Synthesis Springer-Verlag , 
New York, NY; Bodanszky (19 84) The Principles of Peptide 
Synthesis Springer-Verlag, New York, NY; and Dawson, et 
al. (1994) - Science 266:776-779. For example, an azide 
process, an acid chloride process, an acid anhydride 
20 process, a mixed anhydride process, an active ester 
process (for example, p-nitrophenyl ester, N- 
hydroxysuccinimide ester, or cyanomethyl ester), a 
carbodiimidazole process, an oxidative-reductive process, 
or a dicyclohexylcarbodiimide (DCCD) /additive process can 
25 be used. Solid phase and solution phase syntheses are 
both applicable to the foregoing processes . 

The prepared protein and fragments thereof can be 
isolated and purified from the reaction mixture by means 
of peptide separation, for example, by extraction, 
30 precipitation, electrophoresis and various forms of 

chromatography, and the like. The F06B09 proteins of 
this invention can be obtained in varying degrees of 
purity depending upon its desired use. Purification can 
be accomplished by use of known protein purification 
35 techniques or by the use of the antibodies or binding 
partners herein described, e.g., in immunoabsorbant 
affinity chromatography. This immunoabsorbant affinity 
chromatography is carried out, e.g., by first linking the 
antibodies to a solid support and then contacting the 
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linked antibodies with solubilized lysates of appropriate 
source cells, lysates of other cells expressing the 
ligand, or lysates or supernatants of cells producing the 
F0 6B09 proteins as a result of recombinant DNA 
5 techniques, see below. 

Multiple cell lines may be screened for one .which 
expresses an F0 6B09 polypeptide or protein at a high 
level compared' with other cells. Various cell lines, 
e.g., a mouse thymic stromal cell line TA4 , is screened 

10 and selected for its favorable handling properties. 

Natural F06B09 proteins can be isolated from natural 
• sources, or by expression from a transformed cell using 
an appropriate expression vector. Purification of the 
expressed protein is achieved by standard procedures, or 

15 may be combined with engineered means for effective 
purification at high efficiency from cell lysates or 
supernatants. Epitope or other tags, e.g., FLAG or Hisg 

segments, can be used for such purification features. 

20 V. Antibodies 

Antibodies can be raised to various F06B09 proteins, 
including individual, polymorphic, allelic, strain, or 
species variants, and fragments thereof, both in their 
naturally occurring (full-length) forms and in their 

25 recombinant forms. Additionally, antibodies can be 
raised to F0 6B09 proteins in either their native (or 
active) forms or in their inactive, e.g., denatured, 
forms. Anti-idiotypic antibodies may also be used. 
A. Antibody Production 

3 0 A number of immunogens may be used to produce 

antibodies specifically reactive with F06B09 proteins. 
Recombinant protein is a preferred immunogen for the 
production of monoclonal or polyclonal antibodies. 
Naturally occurring protein may also be used either in 

35 pure or impure form. Synthetic peptides, made using the 
human F06B09 protein sequences described herein, may also 
used as an immunogen for the production of antibodies to 
F06B09 proteins. Recombinant protein can be expressed in 
eukaryotic or prokaryotic cells as described herein, and 
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purified as described. Naturally folded or denatured 
material can be used, as appropriate, for producing 
antibodies. Either monoclonal or polyclonal antibodies 
may be generated, e.g., for subsequent use in 
5 immunoassays to measure the protein. 

Methods of producing polyclonal antibodies are well 
known to those of skill in the art. Typically, an 
immunogen, preferably a purified protein, is mixed with 
an adjuvant and animals are immunized with the mixture. 
10 The animal's immune response to the immunogen preparation 
is monitored by taking test bleeds and determining the 
titer of reactivity to the F0 6B09 protein of interest. 
For example, when appropriately high titers of antibody 
to the immunogen are obtained, usually after repeated 
15 immunizations, blood is collected from the animal and 
antisera are prepared. Further fractionation of the 
antisera to enrich for antibodies reactive to the protein 
can be done if desired. See, e.g., Harlow and Lane; or 
Coligan. 

20 Monoclonal antibodies may be obtained by various 

techniques familiar to those skilled in the art. 
Typically, spleen cells from an animal immunized with a 
desired antigen are immortalized, commonly by fusion with 
a myeloma cell (see, Kohler and Milstein (197 6) Eur. J . 
25 Immunol . 6:511-519, incorporated herein by reference). 
Alternative methods of immortalization include 
transformation with Epstein Barr Virus, oncogenes, or 
retroviruses, or other methods known in the art. 
Colonies arising from single immortalized cells are 
30 screened for production of antibodies of the desired 

specificity and affinity for the antigen, and yield of 
the monoclonal antibodies produced by such cells may be 
enhanced by various techniques," including injection into 
the peritoneal cavity of a vertebrate host. 
3 5 Alternatively, one may isolate DNA sequences which encode 
a monoclonal antibody or a binding fragment thereof by 
screening a DNA library from human B cells according, 
e.g., to the general protocol outlined by Huse, et al . 
(1989) Science 246:1275-1281. 
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Antibodies, including binding fragments and single 
chain versions, against predetermined fragments of F06B09 
proteins can be raised by immunization of animals with 
conjugates of the fragments with carrier proteins as 
5 described above. Monoclonal antibodies are prepared from 
cells secreting the desired antibody. These antibodies 
can be screened for binding to normal or defective F06B09 
protein, or screened for agonistic or antagonistic 
activity, e.g., mediated through a receptor. These 
10 monoclonal antibodies will usually bind with at least a 
K D of about 1 mM, more usually at least about 300 |iM, 

typically at least about 100 |iM, more typically at least 
about 3 0 [iM, preferably at least about 10 \m, and more 
preferably at least about 3 [iM or better. 
15 in some instances, it is desirable to prepare 

monoclonal antibodies from various mammalian hosts, such 
as mice, rodents, primates, humans, etc. Description of 
techniques .for preparing such monoclonal antibodies may 
be found in, e.g., Stites, et al . (eds.) Basic and 
20 Clinica!) immunology (4th ed. ) Lange Medical Publications, 
Los Altos, CA, and references cited therein; Harlow and 
Lane (1988) Antibodies: . A L aboratory Manual CSH Press ; 

. Coding (1986) Monoclonal Antibodies: Principles and 

Practice (2d ed. ) Academic Press, New York, NY; and 
25 particularly in Kohler and Milstein (1975) Nature 

256:495-497, which discusses one method of generating 
monoclonal antibodies. Summarized briefly, this method 
involves injecting an animal with an immunogen. The 
animal is then sacrificed and cells taken from its 
3 0 spleen, which are then fused with myeloma cells. The 

result is a hybrid cell or "hybridoma" that is capable of 
reproducing in vitro. The population of hybridomas is 
then screened to isolate individual clones, each of which 
secrete a single antibody species to the immunogen. In 
35 this manner, the individual antibody species obtained are 
the products of immortalized and cloned single B cells 
from the immune animal generated in response to a 
specific site recognized on the immunogenic substance. 
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Other suitable techniques involve selection of 
libraries of antibodies in phage or similar vectors. 
See, e.g., Huse, et al . (1989) "Generation of a Large 
Combinatorial Library of the Immunoglobulin Repertoire in 
5 Phage Lambda," Science 246:1275-1281; and Ward, et al . 
(1989) Nature 341:544-546. The polypeptides and 
antibodies of the present invention may be used with or ■ 
without modification, including chimeric or humanized 
antibodies. Frequently, the polypeptides and antibodies 
10 will be labeled by joining, either covalently or non- 

covalently, a substance which provides for a detectable 
signal. A wide variety of labels and conjugation 
techniques are known and are reported extensively in both 
the scientific and patent literature. Suitable labels 
15 include radionuclides, enzymes, substrates, cof actors, 
inhibitors, fluorescent moieties, chemiluminescent 
moieties, magnetic particles, and the like. Patents 
teaching the use of such labels include U.S. Patent Nos . 
3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 
20 4,275,149; and 4,366,241. Also, recombinant 

immunoglobulins may be produced, see, Cabilly, U.S. 
Patent No. 4,816,567; and Queen, et al . (1989) Proc. 
Naf.'l AcaH. Sci ■ USA 86:10029-10033 ; or made in 
transgenic mice, see Mendez , et al . (1997) Nature 
25 Genetics 15:146-156. 

" The antibodies of this invention are useful for 
affinity chromatography in isolating F06B09 protein. 
Columns can be prepared where the antibodies are linked 
to a solid support, e.g., particles, such as agarose, 
3 0 SEPHADEX, or the like, where a cell lysate or supernatant 
may be passed through the column, the column washed, 
followed by increasing concentrations of a mild 
denaturant, whereby purified F06B09 protein will be 
released. The converse can be performed using protein to 
35 isolate specific antibodies. 

Other antibodies may block enzymatic activity. The 
• antibodies may also be used to screen expression 

libraries for particular expression products. Usually 
the antibodies used in such a procedure will be labeled 
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with a moiety allowing easy detection of presence of 
antigen by antibody binding. 

Antibodies to F0 6B09 proteins may be used for the 
identification of cell populations expressing F06B09 
5 protein. By assaying the expression products of cells 
expressing F0 6B0 9 proteins it is possible to diagnose 
disease, e.g., metabolic conditions. The proteins may 
also be markers for specific tissue or cell 
subpopulations, e.g., dendritic cells. 
10 Antibodies raised against each F06B09 protein will 

also be useful to raise anti-idiotypic antibodies. These 
will be useful in detecting or diagnosing various 
immunological conditions related to expression of the 
respective antigens. 

15 

B . Immunoassays 

A particular protein can be measured by a variety of 
immunoassay methods. For a review of immunological and 
immunoassay procedures in general, see Stites and Terr 
20 (eds. 1991) Basic and Clinical Im munology (7th ed. ) . 

Moreover, the immunoassays of the present invention can 
be performed in many configurations, which are reviewed 
extensively in Maggie (ed. 1980) Enzyme Immunoassay CRC 
Press, Boca Raton, Florida; Tijan (19 85) "Practice and 
25 Theory of Enzyme Immunoassays," -Laboratory Techniques in 
Biochemi^-rv and Molecula r Biology. Elsevier Science 
Publishers B.V., Amsterdam; and Harlow and Lane 
Antibodies. A Laborator y Manual, supra, each of which is 
incorporated herein by reference. See also Chan (ed. 
3 0 1987) Immunoassay: A Practic al Guide Academic Press, 

Orlando, FL; Price and Newman (eds. 1991) Principles and 
Practice of Immunoassays Stockton Press, NY; and Ngo (ed. 
1988) Non-isotopic Immunoassays Plenum Press, NY. 

Immunoassays for measurement of F06B09 proteins or 
3 5 peptides can be performed by a variety of methods known 
to those skilled in the art. In brief, immunoassays to 
measure the protein can be either competitive or 
noncompetitive binding assays. In competitive binding 
assays, the sample to be analyzed competes with a labeled 
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analyte for specific binding sites on a capture agent 
bound to a solid surface. Preferably the capture agent 
is an antibody specifically reactive with F06B09 proteins 
produced as described above. The concentration of 
5 labeled analyte bound to the capture agent is inversely 
proportional to the amount of free analyte present in the 

sample . e? 

In a competitive binding immunoassay, the F06B09 
protein present in the sample competes with labeled 
10 protein for binding to a specific binding agent, for 
example, an antibody specifically reactive with the 
F06B09 protein. The binding agent may be bound to a 
solid surface to effect separation of bound labeled 
protein from the unbound labeled protein. Alternately, 
15 the competitive binding assay may be conducted in liquid 
phase and a variety of techniques known in the art may be 
used to separate the bound labeled protein from the 
unbound labeled protein. Following separation, the 
amount of bound labeled protein is determined. The 
20 amount of protein present in the sample is inversely 

proportional to the amount of labeled protein binding. 

Alternatively, a homogeneous immunoassay may be 
performed in which a separation step is not needed. In 
these immunoassays, the label on the protein is altered 
25 by the binding of the protein to its specific binding 

agent. This alteration in the labeled protein results in 
a decrease or increase in the signal emitted by label, so 
that measurement of the label at the end of the 
immunoassay allows for detection or quantitation of the 
30 protein. 

F06B09 proteins may also be determined by a variety 
of noncompetitive immunoassay methods. For example, a 
two-site, solid phase sandwich immunoassay may be used, 
in this type of assay, a binding agent for the protein, 
35 for example an antibody, is attached to a solid support. 
A second protein binding agent, which may also be an 
antibody, and which binds the protein at a different 
site, is labeled. After binding at both sites on the 
protein has occurred, the unbound labeled binding agent 
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is removed and the amount of labeled binding agent bound 
to the solid phase is measured. The amount of labeled 
binding agent bound is directly proportional to the 
amount of protein in the sample. 
5 Western blot analysis can be used to determine the 

presence of F06B09 proteins in a sample. Electrophoresis 
is carried out, for example, on a tissue sample suspected 
of containing the protein. Following electrophoresis to 
separate the proteins, and transfer of the proteins to a 

10 suitable solid support, e.g., a nitrocellulose filter, 

the solid support is incubated with an antibody reactive 
with the protein. This antibody may be labeled, or 
alternatively may be detected by subsequent incubation 
with a second labeled antibody that binds the primary 

15 antibody. 

The immunoassay formats described above may employ 
labeled assay components. The label may be coupled 
directly or indirectly to the desired component of the 
assay according to methods well known in the art. A wide 

2 0 variety of labels and methods may be used. 

Traditionally, a radioactive label incorporating 3 H, 
125 I, 35 S, 14 C, or 32 P was used. Non-radioactive labels 
include ligands which bind to labeled antibodies, 
f luorophores , chemi luminescent agents, enzymes, and 

2 5 antibodies which can serve as specific binding pair 

members for a labeled ligand. The choice of label 
depends on sensitivity required, ease of conjugation with 
the compound, stability requirements, and available 
instrumentation. For a review of various labeling or 
30 signal producing systems which may be used, see U.S. 

Patent No. 4,391,904, which is incorporated herein by 
reference . 

Antibodies reactive with a particular protein can 
also be measured by a variety of immunoassay methods. 

3 5 For a review of immunological and immunoassay procedures 

applicable to the measurement of antibodies by 
immunoassay techniques, see Stites and Terr (eds.) Basic 
and Clinical Immunology (7th ed. ) supra; Maggio (ed.) 
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Enzyme immunoassay , supra; and Harlow and Lane 
' ^Hhodie? , a LaboratnryManual, supra. 

In brief, immunoassays to measure antisera reactive 
with F06B09 proteins can be either competitive or 
5 noncompetitive binding assays. In competitive binding 
assays, the sample analyte competes with a labeled 
analyte for specific binding sites on a capture agent 
bound to a solid surface. Preferably the capture agent 
is a purified recombinant F06B09 protein produced as 
10 described above. Other sources of F06B09 proteins, 
including isolated or partially purified naturally 
occurring protein, may also be used. Noncompetitive 
assays include sandwich assays, in which the sample 
analyte is bound between two analyte- spec if ic binding 
15 reagents . One of the binding agents is used as a capture 
agent and is bound to a solid surf ace . The second 
binding agent is labeled and is used to measure or detect 
the resultant complex by visual or instrument means. A 
number of combinations of capture agent and labeled 
20 binding agent can be used. A variety of different 

immunoassay formats, separation techniques, and labels 
can be also be used similar to those described above for 
the measurement of F06B09 proteins. Similar methods may 
be used to evaluate or quantitate specific binding 
25 compounds. 

VI. Purified F06B09 proteins 

Human F06B09 protein amino acid sequences are 
provided in Table 1 and SEQ ID NO: 2 or 4. 
30 ' Purified protein or defined peptides are useful for 

generating antibodies by standard methods, as described 
above. Synthetic peptides or purified protein can be 
presented to an immune system to generate polyclonal and 
monoclonal antibodies. See, e.g., Coligan (1991) Current 
35 PjoJ^soJLs in TTnmimolocrv Wiley/Greene, NY; and Harlow and 
Lane (1989) Ar^tibodiegj B t. Moratory Manual Cold Spring 
■ Harbor Press, NY, which are incorporated herein by 
reference. 
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- j The specific binding composition can be used for 

screening an expression library made from a cell line 
which expresses an F06B09 protein. Many methods for 
screening are available, e.g., standard staining of 
5 surface expressed ligand, or by panning. Screening of 

intracellular expression can also be performed by various 
staining or immunofluorescence procedures. The binding 
compositions could be used to affinity purify or sort out 
cells expressing the ligand. 

10 The peptide segments, along with comparison to 

homologous genes, can also be used to produce appropriate 
oligonucleotides to screen a library. The genetic code 
can be used to select appropriate oligonucleotides useful 
as probes for screening. In combination with polymerase 

15 chain reaction (PGR) techniques, synthetic 

oligonucleotides will be useful in selecting desired 
clones from a library, including natural allelic and 
polymorphic variants. 

The peptide sequences allow preparation of peptides 

20 to generate antibodies to recognize such segments, and 
allow preparation of oligonucleotides which encode such 
sequences . The sequence also allows for synthetic 
preparation, e.g., see Dawson, et al . (1994) Science 
2 66:77 6-779. Analysis of the structural features in 

25 comparison with the most closely related reported 

sequences has revealed similarities with other proteins, 
particularly the class of proteins known as proteases . 

VII . Physical Variants 

3 0 This invention also encompasses proteins or peptides 

having substantial amino acid sequence similarity with an 
amino acid sequence of an F06B09 protein. Natural 
variants include individual, polymorphic, allelic, 
strain, or species variants. Conservative substitutions 

3 5 in the amino acid sequence will normally preserve most 
relevant biological activities. In particular, various 
. substitutions can be made, e.g., embodiments with 10-fold 
substitutions, 7-fold substitutions, 5-fold 
substitutions, 3 -fold substitutions, 2-fold, and etc. 
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Such embodiments will typically retain particular 
features, e.g., antigenicity, with the natural forms. 

Amino acid sequence similarity, or sequence 
identity, is determined by optimizing residue matches, if 
5 necessary, by introducing gaps as required. This changes 
when considering conservative substitutions as matches. 
Conservative substitutions typically include 
substitutions within the following groups: glycine, 
alanine; valine, isoleucine, leucine; aspartic acid, 
10 glutamic acid; asparagine, glutamine; serine, threonine; 
lysine, arginine; and phenylalanine, tyrosine. 
Homologous amino acid sequences include natural 
polymorphic, allelic, and interspecies variations in each 
respective protein sequence. Typical homologous proteins 
15 or peptides will have from 50-100% similarity (if gaps 
can be introduced), to 75-100% similarity (if 
conservative substitutions are included) with the amino 
acid sequence of the F0 6B09 protein. Similarity measures 
will be at least about 50%, generally at least about 60%, 
20 more generally at least about 65%, usually at least about 
7 0%, more usually at least about 7 5%, preferably at least 
about 80%, and more preferably at least about 80%, and in 
particularly preferred embodiments, at least about 85% or 
more. See also Needleham, et al . (197 0) J. Mol . Biol. 
25 48:443-453; Sankoff, et al . (1983) Time Warps, String 
Edits, and Macromolecules: The Theory and Practice of 
Sequence nomparison Chapter One, Addis on -Wesley , Reading, 
MA; and software packages from IntelliGenetics , Mountain 
View, CA; and the University of Wisconsin Genetics 
30 Computer Group, Madison, WI . 

Natural nucleic acids encoding mammalian F0 6B09 
proteins will typically hybridize to the nucleic acid 
sequence of SEQ ID NO: 1 or 3 under stringent conditions. 
For example, nucleic acids encoding human F06B09 proteins 
3 5 will normally hybridize to the nucleic acid of SEQ ID NO: 
1 or 3 under stringent hybridization conditions. 
• Generally, stringent conditions are selected to be about 
10° C lower than the thermal melting point (Tm) for the 
probe sequence at a defined ionic strength and pH. The 
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Tm is the temperature (under defined ionic strength and 
pH) at which 5 0% of the target sequence hybridizes to a 
perfectly matched probe. Typically, stringent conditions 
will be those in which the salt concentration is about 
5 0.2 molar at pH 7 and the temperature is at least about 
50° C. Other factors may significantly affect the 
stringency of hybridization, including, among others, 
base composition and size of the complementary strands, 
the presence of organic solvents such as formamide, and 
10 the extent of base mismatching. A preferred embodiment 
will include nucleic acids which will bind to disclosed 
sequences in 50% formamide and 200 mM NaCl at 42° C. 
See, e.g., Sambrook, et al . 

An isolated F06B09 protein DNA can be readily 
15 modified by nucleotide substitutions, nucleotide 

deletions, nucleotide insertions, and short inversions of 
nucleotide stretches. These modifications result in 
novel DNA sequences which encode F06B09 protein antigens, 
their derivatives, or proteins having highly similar 
20 physiological, immunogenic, or antigenic activity. 

Modified sequences can be used to produce mutant 
antigens or to enhance expression. Enhanced expression 
may involve gene amplification, increased transcription, 
increased translation, and other mechanisms. Such mutant 
25 F0 6B09 protein derivatives include predetermined or site- 
specific mutations of the respective protein or its 
fragments. "Mutant F06B09 protein" encompasses a 
polypeptide otherwise falling within the homology 
definition of the human F06B09 protein as set forth 
3 0 above, but having an amino acid sequence which differs 
from that of an F06B09 protein as found in nature, 
whether by way of deletion, substitution, or insertion. 
In particular, "site specific mutant F06B09 protein" 
generally includes proteins having significant similarity 
35 with a protein having a sequence of SEQ ID NO: 2 or 4, 
and as sharing various biological activities, e.g., 
.antigenic or immunogenic , with those sequences, and in 
preferred embodiments contain most or all of the 
disclosed sequence. This applies also to polymorphic 
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variants from different individuals. Similar concepts 
apply to different F06B09 proteins, particularly those 
found in various warm blooded animals, e.g., mammals and 
birds. As stated before, it is emphasized that 
5 descriptions are generally meant to encompass other 
F06B09 proteins, not limited to the human embodiments 
specifically discussed. 

Although site specific mutation sites are 
predetermined, mutants need not be site specific. F06B09 

10 protein mutagenesis can be conducted by making amino acid 
insertions or deletions. Substitutions, deletions, 
insertions, or combinations may be generated to arrive at 
a final construct. Insertions include amino- or 
carboxyl- terminal fusions, e.g. epitope tags. Random 

15 mutagenesis can be conducted at a target codon and the 
expressed mutants can then be screened for the desired 
activity. Methods for making substitution mutations at 
predetermined, sites in DNA having a known sequence are 
well known in the art, e.g., by M13 primer mutagenesis or 

2 0 polymerase chain reaction (PCR) techniques. See also, 

Sambrook, et al . (1989) and Ausubel, et al . (19 87 and 
Supplements) . The mutations in the DNA normally should 
not place coding sequences out of reading frames and 
preferably will not create complementary regions that 
25 could hybridize to produce secondary mRNA structure such 
as loops or hairpins . 

The present invention also provides recombinant 
proteins, e.g., heterologous fusion proteins using 
segments from these proteins. A heterologous fusion 

3 0 protein is a fusion of proteins or segments which are 

naturally not normally fused in the same manner. Thus, 
the fusion product of an immunoglobulin with an F06B09 
protein polypeptide is a continuous protein molecule 
having sequences fused in a typical peptide linkage, 
3 5 typically made as a single translation product and 

exhibiting properties derived from each source peptide. 
■ A similar concept applies to heterologous nucleic acid 
sequences. Antibody fusion proteins are also 
contemplated . 
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In addition, new constructs may be made from 
combining similar functional domains from other proteins . 
For example, protein-binding or other segments may be 
"swapped" between different new fusion polypeptides or 
5 fragments. See, e.g., Cunningham, et al . (19 89) Science 
243:1330-1336; and O'Dowd, et al . (1988) J . Biol. Chem. 
263:15985-15992. Thus, new chimeric polypeptides 
exhibiting new combinations of specificities will result 
from the functional linkage of protein -binding 
10 specificities and other functional domains. 

VIII. Binding Agent:F06B09 Protein Complexes 

An F06B09 protein that specifically binds to or that 
is specif ically' immunoreactive with- an antibody generated 
15 against a defined immunogen, such as an immunogen 

consisting of the amino acid secjuence of SEQ ID NO: 2 or 
4, is typically determined in an immunoassay. The 
immunoassay uses a polyclonal antiserum which was raised 
to a protein of SEQ ID NO: 2 or 4. This antiserum is 
2 0 selected to have low crossreactivity against other 

proteases and any such crossreactivity is removed by 
immunoabsorption prior to use in the immunoassay. 

In order to produce antisera for use in an 
immunoassay, the protein of SEQ ID NO: 2 or 4 is isolated^ 
25 as described herein. For example, recombinant protein 
may be produced in a mammalian cell line. An inbred 
strain of mice such as Balb/c is immunized with the 
protein of SEQ ID NO: 2 or 4 using a standard adjuvant, 
such as Freund's adjuvant, and a standard mouse 
3 0 immunization protocol (see Harlow and Lane, supra) . 

Alternatively, a synthetic peptide, preferably near full 
length, derived from the sequences disclosed herein and 
conjugated to a carrier protein can be used an immunogen. 
Polyclonal sera are collected and titered against the 
3 5 immunogen protein in an immunoassay, for example, a solid 
phase immunoassay with the immunogen immobilized on a 
solid support. Polyclonal antisera with a titer of 10 4 
' or greater are selected and tested for their cross 
reactivity against other proteases, e.g., using a 
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competitive binding immunoassay such as the one described 
in Harlow and Lane, supra, at pages 570-573. Preferably 
two related proteins are used in this determination in 
conjunction with either F06B09 protein. 
5 Immunoassays in the competitive binding format can 

be used, for the crossreactivity determinations. For 
example > a protein of SEQ ID NO: 4 can be immobilized to 
a solid support. Proteins added to the assay compete 
with the binding of the antisera to .the immobilized 

10 antigen. The ability of the above proteins to compete 
with the binding of the antisera to the immobilized 
protein is compared to the protein of SEQ ID NO: 4. The 
percent crossreactivity for the above proteins is 
calculated, using standard calculations. Those antisera 

15 with less than 10% crossreactivity with each of the 
proteins listed above are selected and pooled. The 
cross -reacting antibodies are then removed from the 
pooled antisera by immunoabsorption with the above-listed 
proteins . 

20 The immunoabs orbed and pooled antisera are then used 

in a competitive binding immunoassay as described above 
to compare a second protein to the immunogen protein 
(e.g., the protein motif of SEQ ID NO: 4) . In order to 
make this comparison, the two proteins are each assayed 

25 at a wide range of concentrations and the amount of each 
protein required to inhibit 50% of the binding of the 
antisera to the immobilized protein is determined. If 
the amount of the second protein required is less than 
twice the amount of the protein of SEQ ID NO: 4 that is 

3 0 required, then the second protein is said to specifically 
bind to an antibody generated to the immunogen. 

It is understood that F06B09 proteins are families 
of homologous proteins that comprise two or more genes. 
For a particular gene product, such as the human F06B09 

35 proteins, the term refers not only to the amino acid 

sequences disclosed herein, but also to other proteins 
. that are polymorphic, allelic, non-allelic, or species 
variants or equivalents. It is also understood that the 
term "human F06B09 protein" includes equivalent proteins, 
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-e.g.., nonnatural mutations introduced by deliberate 
mutation using conventional recombinant technology such 
as single site mutation, or by excising short sections of 
DNA encoding F06B09 proteins, or by substituting new 

5 amino acids, or adding new amino acids. Such minor 
alterations must substantially maintain the 
immuno identity of the original molecule 'and/or its 
biological activity. Thus, these alterations include 
proteins that are specifically immuno reactive with a 

10 designated naturally occurring F06B09 protein, for 

example, the human F06B09 protein shown in SEQ ID NO: 4. 
The biological properties of the altered proteins can be 
determined by. expressing the protein in an appropriate 
cell line and measuring, e.g., enzymatic activity under 

15 appropriate conditions. Particular protein modifications 
considered minor would include conservative substitution 
of amino acids with similar chemical properties, as 
described above for F06B09 protein families as a whole. 
By aligning a protein optimally with the protein of SEQ 

20 ID NO: 4, and by using the conventional immunoassays 

described herein to determine immunoidentity, or by using 
lymphocyte chemotaxis assays, one can determine the 
protein compositions' of the invention. 

25 IX. Functional Variants 

The blocking of physiological response to F06B09 
protein may result from the inhibition of enzymatic 
activity of the protein against its substrate, e.g. , 
through competitive inhibition. Thus, in vitro assays of 
3 0 the present invention will often use isolated protein, 
membranes from cells expressing a recombinant membrane 
associated proteins, soluble fragments comprising 
enzymatically active segments of these proteins, or ■ 
fragments attached to solid phase substrates. These 
3 5 assays will also allow for the diagnostic determination 
of the effects of either binding segment mutations and 
•modifications, or protein mutations and modifications, 
e.g., protein analogs. This invention also contemplates 
the use of competitive drug screening assays, e.g., where 
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neutralizing antibodies to antigen or enzyme" fragments 
compete with a test compound for binding to the protein. 
In this manner, the antibodies can be used to detect the 
presence of a polypeptide which shares one or more 
5 antigenic binding sites of the protein and can also be 
used to occupy binding sites on the protein that might 
otherwise interact with, e.g., substrate. 

"Derivatives" of F06B09 proteins include amino acid 
sequence mutants, glycosylation variants, and covalent or 
10 aggregate conjugates with other chemical moieties. 
Covalent derivatives can be prepared by linkage of 
functionalities to groups which are found in F0 6B09 
protein amino acid side chains or at the N- or C- 
termini, e.g., by means which are well known in the art. 
15 These derivatives can include, without limitation, 

aliphatic esters or amides of the carboxyl terminus, or 
of residues containing carboxyl side chains, O-acyl 
derivatives of hydroxyl group-containing residues, and N- 
acyl derivatives of the amino terminal amino acid or 

2 0 amino-group containing residues, e.g., lysine or 

arginine. Acyl groups are typically selected from the 
group of alkyl-moieties including, e.g., C3 to C18 normal 
alkyl, thereby forming alkanoyl aroyl species. Covalent 
attachment to carrier proteins may be important when 
25 immunogenic moieties are haptens. 

In particular, glycosylation alterations are 
included, e.g., made by modifying the glycosylation 
patterns of a polypeptide during its synthesis and 
processing, or in further processing steps. Particularly 

3 0 preferred means for accomplishing this are by exposing 

the polypeptide to glycosylating enzymes derived from 
cells which normally provide such processing, e.g., 
mammalian glycosylation enzymes . Deglycosylation enzymes 
are also contemplated. Also embraced are versions of the 
35 same primary amino acid sequence which have other minor 
modifications, including phosphorylated amino acid 
-residues, e.g., phosphotyrosine, phosphoserine, or 
phospho threonine, or other moieties, including ribosyl 
groups or cross-linking reagents. 
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A major group of derivatives are covalent conjugates, 
of the F06B09 protein or fragments thereof with other 
proteins or polypeptides. These derivatives can be 
synthesized in recombinant culture such as N- or C- 
5 terminal fusions or by the use of agents known in the art 
for their usefulness in cross-linking proteins through 
reactive side groups. Preferred protein derivatization 
sites with cross-linking agents are at free amino groups, 
carbohydrate moieties, and cysteine residues. 

10 Fusion polypeptides between human F06B09 proteins 

and other homologous or heterologous proteins are also 
provided. Heterologous polypeptides may be fusions 
between different related proteins, resulting in, e.g., a 
hybrid protein exhibiting modified substrate or other 

15 binding specificity. Likewise, heterologous fusions may 
be constructed which would exhibit a combination of 
properties or activities of the derivative proteins. 
Typical .examples are fusions of a reporter polypeptide, 
e.g.. , luciferase, with a segment or domain of a protein, 

2 0 e.g., a receptor-binding segment, so that the presence or 
location of the fused protein may be easily determined. 
See, e.g., Dull, etal., U.S. Patent No. 4,859,609. 
Other gene fusion partners include bacterial (i- 

galactosidase, trpE, Protein A, ^-lactamase, alpha 
25 amylase, alcohol dehydrogenase, and yeast alpha mating 
factor. See, e.g., Godowski, et 'al. (1988) Science 
241:812-816. 

Such polypeptides may also have amino acid residues 
which have been chemically modified by phosphorylation, 

30 sulf onation, biotinylation, or the addition or removal of 
other moieties, particularly those which have molecular 
shapes similar to phosphate groups. In some embodiments, 
the modifications will be useful labeling reagents, or 
serve as purification targets, e.g., affinity ligands. 

35 This invention also contemplates the use of 

derivatives of F06B09 proteins other than variations in 
amino acid secjuence or glycosylation . Such derivatives 
may involve covalent or aggregative association with 
chemical moieties. These derivatives generally include 
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the three classes: (1) salts, (2) side chain and terminal 
residue covalent modifications, and (3) adsorption 
complexes, for example with cell membranes. Such 
covalent or aggregative derivatives are useful as 
5 immunogens, as reagents in immunoassays, or in 

purification methods such as for affinity purification of 
ligands or other binding ligands. For example, an F06B09 
protein can be immobilized by covalent bonding to a solid 
support such as cyanogen bromide-activated SEPHAROSE, by 
10 methods which are well known in the art, or adsorbed onto 
polyolefin surfaces, with or without glutaraldehyde 
cross-linking, for use in the assay or purification of 
anti-F06B09 protein antibodies. The F06B09 proteins can 
also be labeled with a detectable group, e.g., 
15 radioiodinated by the chloramine T procedure, covalently 
bound to rare earth chelates, or conjugated to another 
fluorescent moiety for use in diagnostic assays. 
Purification of F06B09 proteins may be effected by 
immobilized antibodies or substrate. 
20 Isolated F06B09 protein genes will allow 

transformation of cells lacking expression of 
corresponding F0 6B09 proteins, e.g., either species types 
or cells which lack corresponding proteins and exhibit 
negative background activity. Expression of transformed 
25 genes will allow isolation of antigenically pure cell 
lines, with defined or single specie variants. This 
approach will allow for more sensitive detection and 
discrimination of the physiological effects of F06B09 
protein substrate proteins. Subcellular fragments, e.g., 
30 cytoplasts or membrane fragments, can be isolated and 
used. 

X. Uses 

The present invention provides reagents which will 
35 find use in diagnostic applications as described 

elsewhere herein, e.g., in the general description for 
■ metabolic abnormalities, or below in the description of 
kits for diagnosis. 
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F06B09 protein nucleotides, e.g., human F06B09 
protein DNA or RNA, may be used as a component in a 
forensic assay. For instance, the nucleotide sequences 
provided may be labeled using, e.g., 32p or biotin and 
5 used to probe standard restriction fragment polymorphism 
blots, providing a measurable character to aid in 
distinguishing between individuals. Such probes may be 
used in well-known forensic techniques such as genetic 
fingerprinting. In addition, nucleotide probes made from 
10 F06B09 protein sequences may be used in in situ assays to 
detect chromosomal abnormalities. 

Antibodies and other binding agents directed towards 
F06B09 • proteins or nucleic acids may be used to purify 
the corresponding F06B09 protein molecule. As described 
15 in the Examples below, antibody purification of F06B09 
protein components is both possible and practicable. 
Antibodies and other binding agents may also be used in a 
diagnostic fashion to determine whether F0 6B09 protein 
components are present in a tissue sample or cell, 
2 0 population using well-known techniques described herein. 
The ability to attach a binding agent to an F06B09 
protein provides a means to diagnose disorders associated 
with F06B09 protein misregulation. Antibodies and other 
F06B09 protein binding agents may also be useful as 
25 histological or sorting markers. As described in the 

examples below, F06B09 protein expression is limited to 
specific tissue types. By directing a probe, such as an 
antibody or nucleic acid to an F06B09 protein, it is 
possible to use the probe to distinguish tissue and cell 
30 types in situ or in vitro. 

This invention also provides reagents with 
significant therapeutic value. The F06B09 protein 
(naturally occurring or recombinant) , fragments thereof, 
and antibodies thereto, along with compounds identified 
3 5 as having binding affinity to an F06B09 protein, are 
useful in the treatment of conditions associated with 
abnormal metabolism, physiology/ or development, 
including abnormal immune responsiveness or non- 
responsiveness . Abnormal proliferation, regeneration, 
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degeneration, and atrophy may be modulated by appropriate 
therapeutic treatment using the compositions provided 
herein. The F06B09 proteins likely play roles in 
regulation or development of hematopoietic cells, e.g., 
5 lymphoid cells, which affect immunological responses. 
Thus, for example, an antagonist of an F06B09 protein 
could be useful in blocking the conversion of an immature 
or inactive immunologically relevant pro-protein to the 
mature or active form. Since the F06B09 proteases were 

10 derived from dendritic cells, antagonists could also be 
important in preventing antigen processing and/ or 
subsequent presentation. In addition, effects on DC 
migration or .dendrite extension between cells may result. 
One potential therapeutic application of F06B09 would be 

15 to block this protease in inflammatory processes 

involving the dendritic cells (DC) . The blocking could 
occur on the F0 6B09 MMP itself or on other molecules 
interacting with it. 

Other abnormal developmental conditions are known in 

2 0 cell types shown to possess F0 6B09 protein encoding mRNA 

by northern blot analysis. See Berkow (ed.) The Merck 
Manual of Diagnosis and Therar>v, Merck & Co., Rahway, N J ; 
and Thorn, et al . Harrison's Pri nciples of Internal 
Medicine, McGraw-Hill, NY. Developmental or functional 
25 abnormalities, e.g., of the immune system, cause 

significant medical abnormalities and conditions which 
may be susceptible to prevention or treatment using 
compositions provided herein. 

Recombinant F0 6B09 protein antibodies can be 

3 0 purified and then administered to a patient. These 

reagents can be combined for therapeutic use with 
additional active or inert ingredients, e.g., in 
conventional pharmaceutically acceptable carriers or 
diluents, e.g., immunogenic adjuvants, along with 
35 physiologically innocuous stabilizers and excipients . 
These combinations can be sterile filtered and placed 
. into dosage forms as by lyophilization in dosage vials or 
storage in stabilized aqueous preparations. This 
invention also contemplates use of antibodies or binding 
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fragments thereof, including forms which are not 
complement binding. 

Drug screening using antibodies or fragments thereof 
can identify compounds having binding affinity to F06B09 
5 protein, including isolation of associated components. 

Various substrate candidates can be screened. Subsequent 
biological assays can then be utilized to determine if 
the compound has intrinsic enzyme blocking activity. 
Likewise, a compound having intrinsic stimulating 
10 activity might activate the activity of an F06B09 
protein. This invention further contemplates the 
therapeutic use of antibodies to FG6B09 protein as 
antagonists . This approach- should be particularly useful 
with other F06B09 protein polymorphic or species 
15 variants. 

The quantities of reagents necessary for effective 
therapy will depend upon many different factors, 
including means of administration, target site, 
physiological state of the patient, and other medicants 
2 0 administered. Thus, treatment dosages should be titrated 
to optimize safety and efficacy. Typically, dosages 
used in vitro may provide useful guidance in the amounts 
useful for in situ administration of these reagents. 
Animal testing of effective doses for treatment of 
2 5 particular disorders will provide further predictive 

indication of human dosage. Various considerations are 
described, e.g., in Gilman,- et al. (eds. 1990) Goodman 
and Gilman's: The Pharmacological Bases of Therapeutics 
(8th ed.) Pergamon Press; and (1990) Remington ' s 
30 Pharmaceutical Sciences (17th ed.) Mack Publishing Co., 
Easton, PA. Methods for administration are discussed 
therein and below, e.g., for oral, intravenous, 
intraperitoneal, or intramuscular administration, 
transdermal diffusion, and others. Pharmaceutical ly 
35 acceptable carriers will include water, saline, buffers, 
and other compounds described, e.g., in the Merck Index, 
Merck & Co . , Rahway, NJ. Dosage ranges would ordinarily 
be expected to be in amounts lower than 1 mM 
concentrations, typically less than about 10 |iM 
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concentrations, usually less than about 100 nM, 
preferably less than about 10 pM (picomolar) , and most 
preferably less than about 1 fM ( f emtomolar ) , with an 
appropriate carrier. Slow release formulations, or a 
5 slow release apparatus will often be utilized for 
continuous administration. 

F06B09 proteins, fragments thereof; antibodies to it 
or its fragments; antagonists; and agonists, may be 
administered directly to the host to be treated or, 
10 depending on the size of the compounds , it may be 

desirable to conjugate them to carrier proteins such as 
ovalbumin or serum albumin prior to their administration. 
Therapeutic formulations may be administered in any 
conventional dosage formulation. While it is possible" 
15 for the active ingredient to be administered alone, it is 
preferable to present it as a pharmaceutical formulation. 
Formulations typically comprise at least one active 
ingredient, as defined above, together with one or more 
acceptable carriers thereof. Each carrier should be both 
2 0 pharmaceutical^ and physiologically acceptable in the 

sense of being compatible with the other ingredients and 
not injurious to the patient. Formulations include those 
suitable for oral, rectal, nasal, or parenteral 
(including subcutaneous, intramuscular, intravenous and 
25 intradermal) administration. The formulations may 

conveniently be presented in unit dosage form and may be 
prepared by any methods well known in the art of 
pharmacy. See, e.g., Gilman, et al . (eds. 1990) Goodman 
and Gilman's: The Pharmacologi cal Bases of Therapeutics 
3 0 (8th ed.) Pergamon Press; and (199 0) Remington ' s 

Pharmaceutical Sciences (17th ed. ) Mack Publishing Co., 
Easton, PA; Avis, et al . (eds. 1993) Pharmaceutical 
Dosage Forms: Parentera l Medications Dekker, NY; 
Lieberman, et al. (eds. 1990) Pharmaceutical Dosage 
3 5 Forms: Tablets Dekker, NY; and Lieberman, et al . (eds. 
1990) Pharmaceutical Dosage Forms: Disperse Systems 
•Dekker, NY. The therapy of this invention may be 
combined with or used in association with other 
therapeutic agents'. 
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Both the naturally occurring and the recombinant 
forms of the F06B09 proteins of this invention are 
particularly useful in kits and assay methods which are 
capable of screening compounds for binding activity to 
5 the proteins, including substrates or competitive 

inhibitors . Several methods of automating assays- have 
been developed in recent years so as to permit screening 
of tens of thousands of compounds in a short period. 
See, e.g., Fodor, et al . (1991) Science 251:767-773, and 
10 other descriptions of chemical diversity libraries, which 
describe means for testing of binding affinity by a 
plurality of compounds. The development of suitable 
assays can be greatly facilitated by the availability of 
large amounts of purified, soluble F06B09 protein as 
15 provided by this invention. 

For example, antagonists or inhibitors can normally 
be found once the protein has been structurally defined. 
Testing of potential substrates or analogs is now 
possible upon the development of highly automated assay 
20 methods using a purified enzyme. In particular, new 
agonists and antagonists will be discovered by using 
screening techniques described herein. Of particular 
importance are compounds found to have a combined 
blockage activity for multiple F06B09 protein substrates, 
25 e.g., compounds which can serve as antagonists for 

polymorphic or species variants of an F06B09 protein. 
Inhibitors can be identified, which may be useful as 
therapeutic entities. 

This invention is particularly useful for screening 
30 compounds by using recombinant protein in a variety of 
drug screening techniques. The advantages of using a 
recombinant protein in screening for specific inhibitors 
include: (a) improved renewable source of the F06B09 
protein from a specific source; (b) potentially greater 
3 5 number of molecules per cell giving better signal to 
noise ratio in assays; and (c) species variant 
specificity (theoretically giving greater biological and 
disease specificity) . 
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One method of drug screening utilizes eukaryotic or 
prokaryotic host cells which are stably transformed with 
recombinant DNA molecules expressing an F06B09 protein 
substrate. Cells may be isolated which express a 
5 substrate in isolation from any others. Such cells, 

either in viable or fixed form, can be used for standard 
enzyme/substrate cleavage assays. See also, Parce , et 
al. (1989) Science 246:243-247; and Owicki, et al . (1990) 
Proc . Nat ; 1 Acad . Sci . USA 87:4007-4011, which describe 

10 sensitive methods to detect cellular responses. 

Competitive assays are particularly useful, where the 
cells (source of F06B09 protein) or homogenates are 
contacted and incubated with a labeled antibody having 
known binding affinity to the protein, such as 125j_ 

15 antibody, and a test sample whose binding affinity to the 
binding composition is being measured. The bound and 
free labeled binding compositions are then separated to 
assess the degree of antigen binding. The amount of test 
compound bound is inversely proportional to the amount of 

20 labeled reagent binding to the known source. Any one of 
numerous techniques can be used to separate bound from 
free antigen to assess the degree of ligand binding. 
This separation step could typically involve a procedure 
such as adhesion to filters followed by washing, adhesion. 

25 to plastic followed by washing, or centrifugation of the 
cell membranes. Viable cells could also be used to 
screen for the effects of drugs on F06B09 protein 
mediated functions, e.g., proprotein activation, 
substrate cleavage, and others. Some detection methods 

30 allow for elimination of a separation step, e.g., a 
proximity sensitive detection system. 

Another method utilizes solubilized, unpurified or 
solubilized, purified F06B09 protein from transformed 
eukaryotic or prokaryotic host cells. This allows for a 

35 "molecular" binding assay with the advantages of 

increased specificity, the ability to automate, and high 
.drug test throughput. 

Another technique for drug screening involves an 
approach which provides high throughput screening for 
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compounds having suitable binding affinity to an F06B09 
J protein, e.g., an antibody, is described in detail in 
Geysen, European Patent Application 84/03564, published 
on September 13, 1984. First, large numbers of different 

5 small peptide test compounds are synthesized on a solid 
substrate, e.g., plastic pins or some other appropriate 
surface, see Fodor, et al . , supra. Then all the pins are 
reacted with solubilized, unpurified or solubilized, 
purified F06B09 protein antibody, and washed. The next 

10 step involves detecting bound F06B09 protein antibody. 
Rational drug design may also be based upon 
structural studies of the molecular shapes of the F06B09 
protein and other effectors or analogs. See, e.g., 
Methods in Enzvmoloav vols. 2 02 and 2 03. Effectors may 

15 be other proteins which mediate other functions in 

response to antigen binding, or other proteins which 
normally interact with the substrate. One means for 
determining which sites interact with specific other 
proteins is a physical structure determination, e.g., x- 

20 ray crystallography or 2 dimensional NMR techniques. 
These will provide guidance as to which amino acid 
residues form molecular contact regions. For a detailed 
description of protein structural determination, see, 
e.g., Blundell and Johnson (1976) Protein Crystallography 

25 Academic Press, NY. 

A purified F06B09 protein can be coated directly 
onto plates for use in the aforementioned drug screening 
techniques. However, non-neutralizing antibodies to 
■ these antigens can be used as capture antibodies to 
30 immobilize the respective antigen on the solid phase. 

Candidates for screening include for hybridomas, to find 
clones with desired binding specificity, or for 
inhibitors, e.g., of enzymatic activity. 

3 5 XI. Kits 

This invention also contemplates use. of F06B09 
proteins, fragments thereof, peptides, and their fusion 
' products in a variety of diagnostic kits and methods for 
detecting the presence of F0 6B09 protein or an F06B09 
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protein substrate. Typically the kit will have a 
compartment containing either a defined F0 6B09 peptide or 
gene segment or a reagent which recognizes one or the 
. other, e.g., substrates or antibodies. 
5 A kit for determining the binding affinity of a test 

compound to an F06B09 protein would typically comprise a 
test compound; a labeled compound, e.g., an antibody 
having known binding affinity for the F06B09 protein; a 
source of F06B09 protein (naturally occurring or 
10 recombinant) ; and a means for separating bound from free 
labeled compound, such as a solid phase for immobilizing 
the F06B09 protein. Once compounds are screened, those 
having suitable binding affinity to the F06B09 protein 
can.be evaluated in suitable biological assays, as. are 
15 well known in the art, to determine whether they act as 

agonists or antagonists to a substrate. The availability 
of recombinant F06B09 polypeptides also provide well 
def ined standards for calibrating such assays. 

A preferred kit for determining the concentration 
20 of, for example, an F06B09 protein in a sample would 

typically comprise a labeled compound, e.g., antibody, 
having known binding affinity for the F06B09 protein, a 
source of F0 6B09 protein (naturally occurring or 
recombinant) , and a means for separating the bound from 
25 free labeled compound, for example, a solid phase for 

immobilizing the F06B09 protein. * Compartments containing 
reagents, and instructions, will normally be provided. 

Antibodies, including antigen binding fragments, 
specific for the F06B09 protein, or fragments thereof, 
3 0 are useful in diagnostic applications to detect the 

presence of elevated levels of F06B09 protein and/or its 
fragments. Such diagnostic assays can employ lysates, 
live cells, fixed cells, immunofluorescence, cell 
cultures, body fluids, and further can involve the 
3 5 detection of antigens related to the ligand in serum, or 
the like. Diagnostic assays may be homogeneous (without 
a separation step between free reagent and F06B09 protein 
complex) or heterogeneous (with a separation step) . 
Various commercial assays exist, such as radioimmunoassay 
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(RIA) , enzyme- linked immunosorbent assay (ELISA) , enzyme 
immunoassay (EIA) , enzyme-multiplied immunoassay 
technique (EMIT) , substrate-labeled fluorescent 
immunoassay (SLFIA) , and the like. For example, 
5 unlabeled antibodies can be employed by using a second 
antibody which is labeled and which recognizes the 
antibody to an F0 6B09 protein or to a particular fragment 
thereof. Similar assays have also been extensively 
discussed in the literature. See, e.g., Harlow and Lane 

10 (1988) Antibodies: A Laboratory Manual , CSH Press, NY; 

Chan (ed. 1987.) Immunoassay: A Practical Guide Academic 
Press, Orlando, FL; Price and Newman "(eds. 1991) 
Principles and Practice of Immunoassay Stockton Press, 
NY; and Ngo (ed. 19 88) Nonisotooic Immunoassay Plenum 

15 Press, NY. 

Anti-idiotypic antibodies may have similar use to 
diagnose presence of antibodies against an F06B09 
protein, as such may be diagnostic of various abnormal 
states. "For example, overproduction of F06B09 protein 

2 0 may result in production of various immunological or 

other medical reactions which may be diagnostic of 
abnormal physiological states, e.g., in cell growth, 
activation, or differentiation. 

Frequently, the reagents for diagnostic assays are 
25 supplied in kits, so as to optimize the sensitivity of 

the assay. For the subject invention, depending upon the 
nature of the assay, the protocol, and the label, either 
labeled or unlabeled antibody, or labeled F0 6B09 protein 
is provided. This is usually in conjunction with other 

3 0 additives, such as buffers, stabilizers, materials 

necessary for signal production such as substrates for 
enzymes, and the like. Preferably, the kit will also 
contain instructions for proper use and disposal of the 
contents after use. Typically the kit has compartments 
3 5 for each useful reagent. Desirably, the reagents are 

provided as a dry lyophilized powder, where the reagents 
may be reconstituted in an aqueous medium providing 
appropriate concentrations of reagents for performing the 
assay. 
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Many of the aforementioned constituents of the drug 
screening and the diagnostic assays may be used without 
modification, or may be modified in a variety of ways. 
For example, labeling may be achieved by covalently or 
5 non-covalently joining a moiety which directly or 

indirectly provides a detectable signal. In any .of these 
assays, the protein, test compound, F06B09 protein, or 
antibodies thereto can be labeled either directly or 
indirectly. Possibilities for direct labeling include 

10 label groups: radiolabels such as 1 25j / enzymes (U.S. 
Pat. No. 3,645,090) such as peroxidase and alkaline 
phosphatase, and fluorescent labels (U.S. Pat. No. 
3,940,475) capable of monitoring the change in 
fluorescence intensity, wavelength shift, or fluorescence 

15 polarization. Possibilities for indirect labeling 

include biotinylation of one constituent followed by 
binding to avidin coupled to one of the above label 
groups . . 

There are also numerous methods of separating the 

2.0 bound from the free antigen, or alternatively the bound 
from the free test compound. The F06B09 protein can be 
immobilized on various matrices followed by washing. 
Suitable matrices include plastic such as an ELISA plate, 
filters, and beads. Methods of immobilizing the F06B09 

2 5 protein to a matrix include, without limitation, direct 
adhesion to plastic, use of a capture antibody, chemical 
coupling, and biotin-avidin. The last step in this 
approach usually involves the precipitation of 
enzyme /antibody or enzyme substrate complex by various 

30 methods including those utilizing, e.g., an organic 

solvent such as polyethylene glycol or a salt such as 
ammonium sulfate. Other suitable separation techniques 
include, without limitation, the fluorescein antibody 
magnetizable particle method described in Rattle, et al . 

35 (1984) Clin. Chem. 3 0:1457-1461, and the double antibody 
magnetic particle separation as described in U.S. Pat. 
•No. 4,659,678. 

Methods for linking proteins or their fragments to 
the various labels have been extensively reported in the 



WO 99/35276 PCT/US98/26214 

.literature and do not require detailed discussion here. 
Many of the techniques involve the use of activated 
carboxyl groups either through the use of carbodiimide or 
active esters to form peptide bonds, the formation of 

5 thioethers by reaction of a mercapto group with an 

activated halogen such as chloroacetyl , or an activated 
olefin such as maleimide, for linkage, or the like. 
Fusion proteins will also find use in these applications. 
Another diagnostic aspect of this invention involves 

10 use of oligonucleotide or polynucleotide sequences taken 
from the sequence of an F06B09 protein. These sequences 
can be used as probes for detecting levels of the F06B09 
protein message in samples from natural sources, or 
patients suspected of having an abnormal condition, e.g., 

15 immune problem. The preparation of both RNA and DNA 

nucleotide sequences, the labeling of the sequences, and 
the preferred size of the sequences has received ample 
description and discussion in the literature. Normally 
an oligonucleotide probe should have at least' about 14 

2 0 nucleotides, usually at least about 18 nucleotides, and 

the polynucleotide probes may be up to several kilobases . 
Various detectable labels may be employed, most commonly 
radionuclides, particularly 32 P. However, other 
techniques may also be employed, such as using biotin 

2 5 modified nucleotides for introduction into a 

polynucleotide. The biotin then -serves as the site for 
binding to avidin or antibodies, which may be labeled 
* with a wide variety of labels, such as radionuclides, 
f luorophores , enzymes, or the like. Alternatively, 

3 0 antibodies may be employed which can recognize specific 

duplexes, including DNA duplexes, RNA duplexes, DNA- RNA 
hybrid duplexes, or DNA-protein duplexes. The antibodies 
in turn may be labeled and the assay carried out where 
the duplex is bound to a surface, so that upon the 
35 formation of duplex on the surface, the presence of 

antibody bound to the duplex can be detected. The use of 
probes to the novel anti-sense RNA may be carried out 
using many conventional techniques such as nucleic acid 
hybridization, plus and minus screening, recombinational 



WO 99/35276 PCT/US98/26214 

probing, hybrid released translation (HRT) , and hybrid 
arrested translation (HART) . This also includes 
amplification techniques such as polymerase chain 
reaction (PCR) . 

5 Diagnostic kits which also test for the qualitative 

or quantitative presence of other markers are also 
contemplated. Diagnosis or. prognosis may depend on the 
combination of multiple indications used as markers. 
Thus, kits may test for combinations of markers. See, 
10 e.g., Viallet, et al . (1989) Progress in Growth Factor 
Res. 1:89-97. 



XII. Substrate Identification 

Having isolated a protease, methods exist for 
15 identifying a target substrate. For example, a candidate 
substrate can be contacted with an F0 6B09 protein in an 
enzymatic reaction. The resulting cleavage or product 
can be analyzed, e.g., using SDS-PAGE, HPLC, spectroscopy 
• or other forms of analysis. For example, the molecular 
20 weight of a protease cleavage product should be compared 
against the molecular weights of the uncleaved substrate 
and the F06B09 protein. The successful candidate 
substrate will exhibit a shift to a lower molecular 
weight. Analysis of the substrate should determine what 
25 site specificity may exist for the enzyme under the 

tested conditions. Alternatively, if the protease acts 
by transforming an inactive substrate to the active form, 
the resulting activity can be assayed, e.g., by the 
result of the activated factor, e.g., proliferation, 
3 0 apoptosis, or activation of a target cell. 

Sequence specificity of products may allow search 
through sequence databases to identify candidate proteins 
as physiologically natural substrates. Alternatively, 
the protease may be involve in antigen processing and 
3 5 presentation to appropriate' immune cells. 

The broad scope of this invention is best understood 
• with reference to the following examples, which are not 
intended to limit the invention to specific embodiments. 
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EXAMPLES 

I . General Methods 

Many of the standard methods below are described or 
5 referenced, e.g., in Maniatis, et al . (1982) Molecular 
Cloning, A Laboratory Manual Cold Spring Harbor 
Laboratory, Cold Spring Harbor Press, NY; Sambrook, et 
. al. (1989) Molecular Clonin g: A Laboratory Manual (2d 
ed. ) Vols. 1-3,' CSH Press, NY; Ausubel, et al . , Biology 
10 Greene Publishing Associates, Brooklyn, NY; or Ausubel, 
et al. (1987 and Supplements) Current Protocols in 
Molecular Biology Wiley /Greene , NY; Irmis, et al . (eds. 
1990) PCR Protocols: A Guide to Methods and Applications 
Academic Press, NY. Methods for protein purification 
15 include such methods as ammonium sulfate precipitation, 
column chromatography, electrophoresis , centrif ugation, 
crystallization, and others. See, e.g., Ausubel, et al. 
(1987 and periodic supplements) ; Deutscher (199 0) "Guide 
to Protein Purification," Methods in Enzvmolocrv vol. 182, 

2 0 and other volumes in this series; and manufacturer's 

literature on use of protein purification products, e.g., 
Pharmacia, Piscataway, NJ, or Bio-Rad, Richmond, CA. 
Combination with recombinant techniques allow fusion to 
appropriate segments (epitope tags), e.g., to a FLAG 
25 sequence or an equivalent which can be fused, e.g., via a 
protein-removable sequence. See; e.g., Hochuli (1989) 
Chemische Industrie 12:69-70; Hochuli (1990) 
"Purification of Recombinant Proteins with Metal Chelate 
Absorbent" in Setlow (ed.) Genetic Engineering, Principle 

3 0 and Methods 12:87-98, Plenum Press, NY; and Crowe, et al . 

(1992) OIAexoress: The High Level Expression & Protein 
Purification System QUIAGEN, Inc., Chatsworth, CA. 

Standard immunological techniques are described, 
e.g., in Hertzenberg, et al . (eds. 1996) Weir ' s Handbook 
3 5 of Experimental Immunology vols. 1-4, Blackwell Science; 
Coligan (1991) Current Protoc ols in Immunology 
•Wiley/Greene, NY; and Methods in Enzymology volumes. 70, 
73, 74, 84, 92, 93, 108, 116, 121, 132, 150, 162, and 
163 . Assays for neural cell biological activities are 
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described, e.g., in Wouterlood (ed. 1995) Neuroscience 
Protocols modules 10,' Elsevier; Methods in Neurosciences 
Academic Press; and Neuromethods Humana Press, Totowa, 
NJ. Methodology of developmental systems is described, 
5 e.g., in Meisami (ed.) Handbook of Human Growth and 
Developmental Bioloav CRC Press; and Chrispeels (ed.) 
Molecular Techniques and Approaches in Developmental 
Biology Interscience . 

FACS analyses are described in Melained, et al . 
10 (1990) Flow Cytometry and Sorting Wiley-Liss, Inc., New 
York, NY; Shapiro (1988) Practica l Flow Cytometry Liss, 
New York, NY; and Robinson, et al . (1993) Handbook of 
Flow Cytometry Methods Wi ley-Li's s, New York, NY. 

15 II. Hematopoietic factors, cells, and cell lines 
rhGM-CSF (specific activity : 2 x 10 6 U/mg; 
Schering-Plough Research Institute, Kenilworth, NJ) was 
used at a saturating concentration of 100 ng/ml (2 00 
U/ml) . rhTNFCX (specific activity : 2 x 10 7 U/mg; 

2 0 Genzyme, Boston, MA) was used at an optimal concentration 
of 2.5 ng/ml (50 U/ml). rhSCF (specific activity : 4 x 
105 U/mg; R&D, Abington, U.K.) and rhM-CSF (specific 
activity : 2 x 10 6 U/mg; R&D) were used at optimal 
concentration of 25 ng/ml. rhG-CSF (ED50: 0.01-0.03 

25 ng/ml R&D) was used at an optimal concentration of 25 
ng/ml. 

Peripheral blood mononuclear cells (PBMC) were 
obtained from healthy donors after Ficoll-Hypaque 
gradient centrif ugation (d=1.077; Eurobio, Paris, 

30 France) . T cells were purified from PBMC by 

immunomagnetic depletion (Dynal, Oslo, Norway) using a 
cocktail of mAbs (CD14, CD16, CD35, HLA-DR, (Immunotech, 
Marseille, France), CD19 (ascites), NKH1 (Coulter, 
Hialeah, FL) , CD40 (mAb 89 produced in the laboratory) ) . 

35 The purity of CD3+ T cells was higher than 95%. T cells 
were activated with coated anti-CD3 and soluble anti-CD2 8 
■ mAbs for 3, 12 and 24 h. B cells were obtained from 
human tonsils as described (Liu, et al . (1996) Immunity . 
4:603-613) . Briefly, T cells were first depleted by 
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, rosetting sheep red blood cells and then the residual 
non-B cells were removed by immunomagnetic depletion . 
using a cocktail of mAbs (CD2 , CD3 , CD4 , CD14, CD16, 
NKH1, CD35) . The purity of CD19 + B cells was higher than 
5 98%. Langerhans cells were prepared from normal skin by 
CDla positive selection as described (Le Varlet, .et al. 
(1992) J . Leukoc . Biol. 51:415-420). Granulocytes were 
generated in vitro from CD34 + progenitors in the presence 
of G-CSF and SCF for 12 days . Macrophages were generated 

10 in vitro by culturing human cord blood CD3 4+ progenitors 
with M-CSF and SCF for 12 days (Szabolcs, et al . (1996) 
Blood . 87:4520-4530). Cells were unactivated or 
activated by PMA-ionomycin for 1 and 6 h (PMA: 1 ng/xnl, 
Sigma, St. Louis, MO; Ionomycin: 1 pg /ml, Calbiochem, La 

15 Jolla, CA) and pooled. The TFl (erythrocytic) , Jurkat (T 
cell), MRC5 (fibroblastic), JY ( lymphoblas toid B cell), 
and U937 (myelomonocytic ) cell lines were obtained from 
American Type Culture Collection (ATCC, Rockville, MD) . 
CHA is an epithelial kidney carcinoma cell line kindly 

20 provided by C. Bain (Centre Leon Berard, Lyon, France) . 
All cell lines were stimulated by PMA-ionomycin f or . 1 h 
and 6 h and pooled. Murine fibroblasts transfected with 
human CD40 ligand (CD40L L cells) were produced in the 
laboratory (Garrone, et al . (1995) J. Exp. Med. 182:12 65- 

25 1273. All cell types were cultured in RPM1 1640 (GIBCO 
BRL, Gaithersburg, MD) supplemented with 10% (vol/vol) 
heat-inactivated fetal bovine serum ( FBS ; Flow 
laboratories, Irvine, UK) , 10 mM Hepes, 2 mM L-glutamine , 
5 x 10 ~ 5 M 2 -mercaptoethanol , penicillin (100 U/ml) and 

3 0 streptomycin (100 |ig/ml; hereafter referred to as 
complete medium) . 

Generation of DC from CD3 4 + cells and from monocytes. 

Umbilical cord blood samples were obtained according 
to appropriate institutional guidelines . Isolation of 
3 5 CD34 + progenitors was achieved using Minimacs separation 
columns (Miltenyi Biotec GmbH) as described by Caux, et 
al. (1996) J . Exp . Med . 184:695-706. In all experiments, 
the isolated cells were 80 to 99% CD34 + as judged by 
staining with anti-CD34 mAb. Cultures of CD34+ cells 
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were established in the presence of SCF, GM-CSF, and TNFcx 
as described by Caux, et al . (1992) Nature 3 60:258-261; 
or Caux, et al . (1996) J. Exp. Med. 184:695-706. Cells 
collected after 6 days of culture were separated 
5 according to CDla and CD14 expression into CD14 + CDla" and 
CD14"CDla + using a FACStar* (Becton Dickinson, Mountain 
View, CA) as described (Caux, et al . (1996) J. Exp. Med. 
184:695-706). Cells were further cultured in presence of 
GM-CSF and TNFa until day 12-17, when 70-90% of cells are 

10 CDla + DC. Monocytes were purified by immunomagnetic 

depletion (Dynal) after preparation of PBMC followed by a 
52% Percoll gradient. The depletion was performed with 
anti-CD3, anti-CD19, and anti-CD8 ascites, and with 
purified anti-NKHl (Coulter) and anti-CD16 ( Immunotech) 

15 mAbs. Monocyte-derived dendritic cells were produced by 
culturing purified monocytes for 6 days in the presence 
of GM-CSF and IL-4 (Sallusto and Lanzavecchia (1994) 
Exp. Med: 179:1109-1118) . Cells were activated with LPS 
at the concentration of 25 ng/ml for 1 h to 72 h or with 

20 CD40L transfected L cells (one L cell for five DC) (Caux, 
et al. (1994) J . Exp. Med. 180:1263-1272). 

III. cDNA libraries and isolation of F06B09 cDNA clone 

Total RNA was isolated from PMA-ionomycin activated - 
CDla~CD14 + DC (at day 12 of the culture) and from 
activated CHA cell line. See Chomczynski and Sacchi 
(1987) Anal. Biochem. 162:156-159. RNA was treated with 
DNase I before mRNA purification using the Oligotex-dT 
kit (Qiagen GmbH, Hilden, Germany). PolyA+ RNA (2 fig) 
was used to make a cDNA library in the pSport vector 
(Superscript Plasmid System Kit, GIBCO BRL) . A , 
subtraction library was made using the method of Hara et 
al. (1994) Blood 84:189-199, with minor modifications. 
In this protocol, tracer (subtracted) cDNA was the CD14+- 
derived DC cDNA, and driver (subtractive) cDNA was CHA 
cDNA. A 0.6 Kb cDNA containing a polyA tail was isolated 
from the CD14+ -derived DC subtraction library. The cDNA 
of this gene was first amplified using the RACE 
MARATHON™ kit (Clontech, Palo Alto, CA) and two 
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J oligonucleotides : 5 ' CAGAAATGCCACGAAACAGCCAGGTACT (NGSP1 ; 
SEQ ID NO: 5) and 5 ' GCCCCAGTTGCTCATACAAACAGATCAG (GSP1; 
SEQ ID NO: 6) with a recommended cycling program 1. PCR 
products were cloned in the pCRII plasmid (Invitrogen, 
5 San Diego, CA) . A lambda CD34 + -derived DC cDNA library 
was constructed using the GREAT LENGTHS™ cDNA Synthesis 
kit with the A/TriplEx vector (Clontech) , and was next 

screened with a 5' F06B09 probe to obtain a full-length 
cDNA. Sequencing was performed on both strands by the 

10 dideoxynucleotide method using a Taq Dye Deoxy Terminator 
Cycle Sequencing kit (Applied Biosystems, Foster City, 
CA) and an automated sequencer (Applied Biosystems) . 

A clone encoding the human F06B09 protein is 
isolated from a natural human dendritic cell or other 

15 source, by one of many different possible methods. Given 
the sequences provided herein, PCR primers or 
hybridization probes are selected and/or constructed to 
isolate .a nucleic acid, e.g., genomic DNA segments or 
cDNA reverse transcripts . Appropriate cell sources 

20 include human tissues, e.g., dendritic cell libraries. 
Tissue distribution below also suggests source tissues. 
Genetic and polymorphic or allelic variants are isolated 
by screening a population of individuals. 

This clone was discovered via EST analysis of human 

25 dendritic cell subtraction cDNA library- The driver was 
CD34+ derived, CD14+, PMA, and ionomycin activated 
dendritic cell cDNA, while subtractor was PMA and 
ionomycin activated, kidney carcinoma cell line CHA. The 
initial poly-A containing EST was selected for its 

3 0 restricted distribution after Northern blot and semi- 
quantitative PCR analysis. 

A novel 0.6 Kb partial cDNA (F06B09) was isolated by 
screening a PMA-ionomycin activated CD34 + -derived DC 
subtraction library. Northern blots probed with the 
35 F06B09 clone showed a 3.7 Kb mRNA transcript 

predominantly expressed in dendritic cells. This mRNA 
• was absent by RT-PCR in CHA, the driver epithelial cell 
line used for subtraction. The 5' end of the original 
sequence was extended by RACE and by the screening of a 
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lambda DC cDNA library, to a final 3 691 bp cDNA. This 
cDNA contains a methionine codon located in a consensus 
Kozak sequence. See Kozak (1986) Cell. 44:283-292.^ The 
full-length cDNA (see Table 1) shows a 342 bp 5' 
5 untranslated sequence (nt 1-342), a 1689 bp open reading 
frame (nt 343-2031), a 3' untranslated sequence of 1660 
bp (nt 2032-3691) and a polyadenylation signal AATAAA at 
position 3 638-3 643 followed by a poly (A) tail. The 
encoded protein of 562 amino acids reveals a strong 
10 homology with membrane-type matrix metalloproteinase 

(MT-MMP) . As a member of the metalloproteinase family, 
F06B09 contains a propeptide domain with a cysteine- 
switch activation domain at Cys69 (Van Wart and Birkedal- 
Hansen (1990) Ptoc. Nat] - Acad . Sci . USA. 87:5578-5582) 
15 and the core enzyme domain contains three zinc -chelating 
histidine (H) residues at positions 212, 216 and 222 in 
the zinc binding motif. Like other MT-MMP members, 
F06B09 presents a consensus insertion RRRR between 
residues 82-86, corresponding to a furin cleavage site 
2 0 (Table 1) . The sequence is followed by a hinge region 
(2 60-290) and a potential transmembrane domain of 12 
amino acids (526-577) in a hemopexin-like domain 
(291-538) and a short intracytoplasmic domain (538-541) . 
Multiple alignment with members of the membrane-type 
25 matrix metalloproteinase (MT-MMP) family revealed the 

closest homology with the MT4-MMP (48%) (Puente, et al. 
(1996) r^rsr Res. 56:944-949), and 38%, 39% and 35% 
respectively with the MT1-MMP (Sato, et al . (1994) Nature 
37 0:61-65) the MT2-MMP (Will and Hinzmann (1995) Eur , J . 
30 Biochem. 231:602-608) and the MT3-MMP (Takino, et al . 

(1995) ,t gjgJL Chem. 270:23013-23020). Comparison of 
the most conserved domain, the catalytic domain, showed 
that F06B09 presents the highest homology to MT4-MMP 
(48%) (Puente, et al . (1996) Cancer Res. 56:944-949) and 
3 5 significant homologies to other members of the matrix 

membrane metalloproteinase family (MMP) , like the type IV 
collagenases MMP-9 and MMP-2 (Collier, et al . (1988) 
Biol. Chem. 263:6579-6587; Wilhelm, et al . (1989) 
Biol. Chem. 264:17213-17221) . 
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Therefore, it is ■ suggested that the F06B09 is a 
fifth member of the MT-MMP subgroup of the MMP. 

The coding sequence appears to be complete, encoding 
a 21 amino acid putative signal peptide followed by a 541 
5 residue polypeptide with significant homology to the 

membrane -type matrix metalloproteases MT-MMP1 to 4 . No 
evidence yet suggests alternative splicing of this 
message The limited EST distribution is indicative of a 
restricted expression pattern. 

10 Further clones will be isolated, e.g., using an 

antibody based selection procedure. Standard expression 
cloning methods are applied including, e.g., FACS 
staining of membrane associated expression product. The 
antibodies are used to identify clones producing a 

15 recognized protein. Alternatively, antibodies are used 
to purify an F06B09 protein, with protein sequencing and 
standard means to isolate a gene encoding that protein. 

Genomic or cDNA sequence based methods will also 
allow for identification of sequences naturally 

2 0 available, or otherwise, which exhibit homology to the 
provided sequences . 

IV. Isolation of mouse F06B09 

Similar methods are used as above to isolate an 
25 appropriate F06B09 protein gene. See, e.g., GenBank 
Accession numbers X91785, X83537; D63 579, and U54984. 
Similar source materials as indicated above are used to 
isolate natural genes, including genetic, polymorphic, 
allelic, or strain variants. Species variants are also 
30 isolated using similar methods. Various sequence 

databases may suggest related or counterpart sequences . 
See, e.g., Capone, et al . (1996) J . Immunol. 157:969-973. 

V. Isolation of an avian F06B09 protein clone 

35 An appropriate avian source is selected as above. 

Similar methods are utilized to isolate other species 
• variants, though the level of similarity will typically 
be lower for avian F0 6B09 protein as compared to a human 
to mouse sequence. 
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VI. Message distribution 

PCR based detection is performed by standard 
methods, preferably using appropriate primers from 
opposite ends of the coding sequence, but flanking 
segments might be selected for specific purposes.. 

Alternatively, hybridization probes are selected. 
Particular AT or GC contents of probes are selected 
depending upon the expected homology and mismatching 
expected. Appropriate stringency conditions are selected 
to balance an appropriate positive signal to background 
ratio. Successive washing steps are used to identify 
clones of greater homology. . . 

Total RNA (20 M-g), extracted from cell lines or cell 
15 preparations as described above, were fractionated by 
electrophoresis on a 1% agarose- formaldehyde gel and 
transferred onto positively charged nylon membrane 
(GeneScreenPlus, NEN Life Science Products, Boston, MA) 
as described by Thomas (1980) Pror Natl, ftcad , Sci . USA . 
77:5201-5205. After transfer, blots were cross-linked by 
UV light (Stratal inker, La Jolla, CA) . The original 
cloned 600 bp fragment was labeled by random priming with 
32p_ dC TP (3000 Ci/mmol, Amersham; Ready to Go, Pharmacia 
Biotech, Orsay, France) and unincorporated nucleotides 
25 were removed by spin column chromatography (Chromaspin- 
100, Clontech) . Hybridization and washes were performed 
in stringent conditions (0.1X SSC/0.1% SDS at 65° C) . X- 
ray films (Kodak, Rochester, NY) were exposed for 3 weeks 
at -80° C with intensifying screens. Multiple tissue 
normal fetal and adult organs Northern blots (purchased 
from Clontech) were similarly used according to the 
manufacturer ' s recommendations . 

For RT-PCR methods, sotal RNA extracted from 1 to 10 
x 10* cells (Chomczynski and Sacchi (1987) Anal . Biochem. 
162-156-159) were reverse transcribed using random 
hexamer primers (Pharmacia, Upsalla, Sweden) and the 
Superscript RNase-H reverse transcriptase (GIBCO BRL) . 
PCR was performed in a 100 ul volume using 5 ng cDNA, 10 
Hi 10X PCR reaction buffer (Perkin' Elmer Cetus, Norwalk, 
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,CT) , 2.5 U of Taq polymerase (Gene Amp PCR reagents kit: 
Perkin Elmer Cetus) and 200 mM dNTPs and 500 nM of the 5' 
and 3' amplification primers. The PCR reactions were 
made in a DNA thermal cycler (Perkin Elmer) for 35 cycles 
5 (1 min denaturation at 94° C, 1 min annealing at 60° C, 
and 2 min elongation at 72° C) . P actin RT-PCR was used 
as positive control for the efficiency of the reaction 
using sense and antisense primers. Appropriate sense and 
antisense primers were used to amplify F06B09. See Table 
10 1. 

Northern blot showed a single band of about 4.5 kb 
in non-activated and PMA and ionomyc in-activated, CD3 4+ 
derived human DC, and a weak signal in in vitro generated 
granulocytes. No signal was detected in TF1 , Jurkat, 

15 CHA, or JY cell' lines, nor in freshly isolated monocytes, 
activated T cells, resting and activated PBLs, or B 
cells. No expression was found in either fetal or adult 
tissues. PCR distribution analysis showed expression in 
activated DC and the MRC5 lung fibroblast cell line, as 

20 well as very low signal in U937. The original EST was 
extended by 5' RACE. A lambda DC cDNA library was 
screened with a 5' probe. An ORF was identified, which 
showed highest homology with the human MT4-MMP, a recent 
addition to the membrane type matrix me tallopro tease 

25 family. Positive signals were also detected in 
granulocytes . 

Northern blot analysis showed a single -4 Kb 
transcript predominantly expressed in resting 
CD3 4 + -derived DC, to a lesser extent in PMA-ionomycin 

3 0 activated DC, and weakly in granulocytes generated in 
vitro. The expression pattern of this novel gene was 
also analyzed by RT-PCR, on freshly isolated cells and on 
various cell lines. Similarly, RT-PCR analysis confirmed 
the higher level of F06B09 expression in resting and 

3 5 activated CD34 + -derived DC, in granulocytes and to a 
weaker extent in resting PBLs. F06B09 mRNA is also 
weakly present on the B cell line, JY. No messenger was 
detected in TF1 ( myelo- erythrocytic ) , CHA (carcinoma) , 
Jurkat (T cell) , MRC5 (fibroblastic) or U937 (myelo- 
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monocytic) cell lines, nor in freshly isolated monocytes, 
activated T and B cells or activated PBLs . Among normal 
human tissues tested, a significant band of -4 Kb was 
seen in spleen, lymph node, thymus, appendix, PBL and in 
5 bone marrow but absent in fetal tissues . An additional 6 
Kb band corresponding probably to unspecific expression 
or to a longer existing form, was also detected in 
spleen, PBL and bone marrow but absent in lymph node, 
thymus, and appendix. 
10 The cellular distribution of F06B09 was next 

determined by the extent of hybridization among the 
gel-fractionated population of cDNA inserts from 
libraries made from different cell types. Consistent 
with the above observations, F06B09 is present in both 
15 CD3 4 + -derived DC and in monocyte-derived DC, but also in 
effector T cells, including Thl and Th2 cells, and to a 
weaker extent in NK cells. F06B09 mRNA expression is 
down-regulated after PMA-ionomycin activation at once in 
DC and T cells. No signal was detected in monocytes, B 
20. cell lines nor in different fetal tissues. 

In conclusion, the novel MT-MMP appears to be mainly 
transcribed by resting DC and weakly by effector T cells. 

F0 6B09 mRNA is strongly expressed in different types 
of DC and down-regulated by CD40L . activation . Since the 
25 original F06B09 clone was identified in a DC library, 
.further characterization was performed by semi- 
quantitative RT-PCR. The expression of this gene was 
analyzed during DC differentiation and maturation, either 
in DC generated in vitro from CD34 + cord blood 
3 0 progenitors cultured with GM-CSF and TNFa or from 

monocytes cultured with GM-CSF and IL-4. During the 
culture of CD34+ human cord blood progenitors, F06B09 is 
first detected at day 6 and increases up to day 12 . This 
messenger is down-regulated after triggering final 
3 5 maturation of the DC by 4 days co-culture with 

hCD40L-transfected L cells. Similarly, while monocytes 
do not express detectable amount of F06B09 mRNA, a 
significant expression could be detected after 6 days of 
culture in the presence of GM-CSF and IL-4. In contrast, 
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^following activation of these monocyte-derived DC through 
CD40, the level of mRNA decreases rapidly within 3 h to 
12 h, A low amount of F06B09 mRNA is also found in Ih 
PMA-ionomycin activated CDla + and CD14 4 * DC subsets, which 
5 is down-regulated after 6h PMA-ionomycin activation. Day 
12 macrophages generated in vitro express also weakly 
F06B09, and 6 h PMA-ionomycin activation of these cells 
is enough to switch off the signal. In contrast, no 
signal is detected in freshly isolated Langerhans cells, 

10 in basal keratinocytes , in freshly isolated and CD40L 
activated B cells, and in anti-CD3 and anti-CD28 
activated T cells. 

Taken together, these results confirm that F06B09 
mRNA is expressed in different DC subtypes and rapidly 

15 down-regulated upon DC maturation. 

VII. Chromosomal localization.. 

The full-length cDNA sequence of F06B09 was analyzed 
against the EMBL. nucleotide and EST databases and 
20 resulted in identification of a 436 bp EST (W72721) , 
matching exactly with the F06B09 sequence. 

Comparison of the full-length cDNA sequence of 
F06B09 with the EMBL nucleotide and EST databases 
identifies a 436 bp EST (W72721) , corresponding exactly 
25 to the F0 6B09 sequence. This EST is located on 

chromosome 16pl3.3. In contrast,' MT1-MMP and MT3-MMP 
have been previously located on chromosome 14qll-12 and 
8q21. 3-22.1 respectively (Mattei, et al . (1997) Genomics. 
40:168-169.; Mignon, et al . (1995) Genomics. 28:360-361). 
3 0 Of note, the novel F06B09 MT-MMP gene is on the same 

chromosome than MT2-MMP, but both genes are located on a 
different loci; MT2-MMP is on chromosome 16ql2 (Mattei, 
et al. (1997) Genomics. 40:168-169; Yasumitsu, et al . 
(1997) DNA Res . 4:77-79) whereas the novel MT-MMP is on 
35 chromosome 16pl3 . 3 . 



.VIII. Expression; purification; characterization 

With an appropriate clone from above, the coding 
sequence is inserted into an appropriate expression 
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yector. This may be in a vector specifically selected 
for a prokaryote, yeast, insect, or higher vertebrate, 
e.g., mammalian expression system. Standard methods are 
applied to produce the gene product, preferably as a 
5 soluble secreted molecule, but will, in certain 

instances, also be made as an intracellular protein. 
Intracellular proteins typically require cell lysis to 
recover the protein, and insoluble inclusion bodies are a 
common starting material for further purification. 

10 With a clone encoding a vertebrate F06B09 protein, 

recombinant production means are used, although natural 
forms may be purified from appropriate sources, e.g., 
expressing cell lines. The protein product is purified 
by standard methods of protein purification, in certain 

15 cases, e.g., coupled with immunoaf f inity methods. 

Immunoaf f inity methods are used either as a purification ' 
step, as described above, or as a detection assay to 
determine the partition properties of the protein. 

Preferably, the protein is secreted into the medium, 

20 and the soluble product is purified from the medium in a 
soluble form. Standard purification techniques applied 
to soluble proteins are then applied, with enzyme assays 
or immunodetection methods useful for following where the 
protease purifies. Alternatively, as described above, 

25 inclusion bodies from prokaryotic expression systems are 
a useful source of material. Typically, the insoluble 
protein is solubilized from the inclusion bodies and 
refolded using standard methods. Purification methods 
are developed as described above. 

3 0 In certain embodiments, the protein is made in a 

eukaryotic cell which glycosylates the protein normally. 
The purification methods may be affected thereby, as may 
biological activities . The intact protein can be 
processed to release the protein domain, probably due to 

3 5 a cleavage event. While recombinant protein appears to 
be processed, the physiological processes which normally 
•do such in native cells remain to be determined. 

The product of the purification method described 
above is characterized to determine many structural 
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features. Standard physical methods are applied, e.g., 
amino acid analysis and protein sequencing. The 
resulting protein is subjected to CD spectroscopy and 
other spectroscopic methods, e.g., NMR, ESR, mass 
5 spectroscopy, etc. The product is characterized to 

determine its molecular form and size, e.g., using gel 
chromatography and similar techniques. Understanding of 
the chromatographic properties will lead to more gentle 
or efficient purification methods . 
10 Prediction of glycosylation sites may be made, e.g., 

as reported in Hansen, et al . (1995) Biochem. J . 308:801- 
813. 

IX. Preparation of antibodies against vertebrate F06B09 
15 protein 

With protein produced and purified, as above, 
animals are immunized to produce antibodies. Polyclonal 
antiserum may be raised using non-purified antigen, 
though the resulting serum will exhibit higher background 
20 levels. Preferably, the antigen is purified using 

standard protein purification techniques, including, 
e.g., affinity chromatography using polyclonal serum 
indicated above. Presence of specific antibodies is 
detected using defined synthetic peptide fragments. 
25 Alternatively, polyclonal serum is raised against a 

purified antigen, purified as indicated above, or using 
synthetic peptides. A series of overlapping synthetic 
peptides which encompass all of the full length sequence, 
if presented to an animal, will produce serum recognizing 
3 0 most linear epitopes on the protein-. Such an antiserum 
is used to affinity purify protein. This purified 
protein, in turn, may be used to immunize another animal 
to produce another antiserum preparation. 

Standard techniques are used' to generate induce 
3 5 monoclonal antibodies to either unpurified antigen, or, 
preferably, purified antigen. 

X. Structure activity relationship 
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Information on the .criticality of particular 
residues is determined using standard procedures and 
analysis. Standard mutagenesis analysis is performed, 
e.g., by generating many different variants at determined 
5 positions, e.g., at the positions identified above, and 
evaluating biological activities of the variants.. This 
may be performed to the extent of determining positions 
which modify activity, or to focus on specific positions 
to determine the residues which can be substituted to 

10 either retain, block, or modulate biological activity. 

Alternatively, analysis of natural variants can 
indicate what positions tolerate natural mutations. This 
may result from populational analysis of variation among 
individuals, or across strains or species. Samples from 

15 selected individuals are analyzed, e.g., by PCR analysis 
and sequencing. This allows evaluation of population 
polymorphisms . 

All references cited herein are incorporated herein 
by reference to the same extent as if each individual 
publication or patent application was specifically and 
individually indicated to be incorporated by reference in 
its entirety for all purposes. 

Many modifications and variations of this invention 
can be made without departing from its spirit and scope, 
as will be apparent to those skilled in the art. The 
specific embodiments described herein are offered by way 
of example only, and the invention is to be limited only 
by the terms of the appended claims, along with the full 
scope of equivalents to which such claims are entitled. 
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1. A substantially pure or recombinant F06B09 
polypeptide -exhibiting at least about 80% sequence 

5 identity over a length of at least about 100 amino acid 
residues to SEQ ID NO: 4. 

2. A fusion protein comprising a polypeptide of claim 
1. 

10 

3 . A binding compound which specifically binds to a 
polypeptide of claim 1. 

4. The binding compound of claim 3 which is an antibody 
15 or antibody fragment. 

5. A nucleic acid encoding a polypeptide of claim 1. 

6. An expression vector comprising the nucleic acid of 
20 claim 5. 

7. A host cell comprising the vector of claim 6. 

8. A process for recombinatly producing a polypeptide 
25 comprising culturing the host cell of claim 7 under 

conditions in which the polypeptide is expressed. 
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SEQUENCE LISTING 



SEQ 


ID 


NO: 


1 


is 


human F06B-09 nucleotide sequence. 


SEQ 


ID 


NO: 


2 


is 


human F06B09 amino acid sequence. 


SEQ 


ID 


NO: 


3 


is 


second human F6B09 nucleotide sequence. 


SEQ 


ID 


NO: 


4 


is 


second human F06B09 amio acid sequence. 


SEQ 


ID 


NO: 


5 


is 


sense primer. 


SEQ 


ID 


NO: 


6 


is 


ant is ens e primer 


SEQ 


ID 


NO: 


7 


is 


primate MT4-MMP amino acid sequence. 


SEQ 


ID 


NO: 


8 


is 


primate MT2 -MMP amino acid sequence. 


SEQ 


ID 


NO: 


9 


is 


primate MT1-MMP amino acid sequence. 


SEQ 


ID 


NO: 


10 


is primate MT3-MMP amino acid sequence. 



15 (1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Schering Corporation 

(B> STREET: 2000 Galloping Hill Road 

(C) CITY: Kenilworth 

20 (D) STATE: New Jersey 

<E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) 07033 

(G) TELEPHONE (908) 298-5056 

(H) TELEFAX: (908) 298-5388 
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(ii) TITLE OF INVENTION: MAMMALIAN PROTEINASES; 
RELATED REAGENTS AND METHODS 

(iii) NUMBER OF SEQUENCES: 10 



(iv) COMPUER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: Power Macintosh 

(C) OPERATING -SYSTEM: Mac OS 8.1 
35 (D) SOFTWARE: Microsoft Word 6.0.1 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
40 (C) CLASSIFICATION 



(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: U.S. 09/005,263 

(B) FILING DATE: 09-JAN-199 8 



45 

(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 3695 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

( D ) TOPOLOGY : 1 inear 



55 (ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 344.. 2032 

5 (ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 398.. 2032 

(ix) FEATURE: 
10 (A) NAME / KEY : misc.feature 

(B) LOCATION: 3458 

(D) OTHER INFORMATION: /note= "nucleotide 3458 designated 
A, may be A or T" 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CATGCAACAT AATC TTGCTC GATTCTAAAG TCAACGGATC CTGCAAAATT CGCGGCCGCG 60 

20 TCAACCCATT AGGTCTTGGC CTTGGAATAA AATTGCTTCT CGTCTGATTC CCGGGCCCAC 12 0 

CCGACCCAGC GGCGCAACCC TGGCCCTCCG GGACCCTCCG CTGACTCCAC CGCGCACTTC 180 

CCGGGACCCC CACACACATC CCAGCCCTCC GGCCGATCCC TCCCTACTCG GTGCCGGGTG 240 

CCCCCCTTTT TTTTCTAGGC CCGGATCTCC TCCCCCAGGT CCCCGGGGCG GCCCCAACCA 3 00 

GGCCCCCTTC AAACCCCGCC GGCGGCCCGG GCTGGGGCGC ACC ATG CGG CTG CGG 355 

Met Arg Leu Arg 
-18 ' -15 

CTC CGG CTT CTG GCG CTG CTG CTT CTG CAT GCT GGC ACC GCC CGC GCG 403 
Leu Arg Leu Leu Ala Leu Leu Leu Leu His Ala Gly Thr Ala Arg Ala 
-10 -5 1 

CGC CCC GAA GCC CTC GGC GCA GGA CTT AGC CTG GGC TGT GAG AAC TGG 451 
Arg Pro Glu Ala Leu Gly Ala Gly Leu Ser Leu Gly Cys Glu Asn Trp 
5 10 15 
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40 CTG ACT- CGC TAT GGT TAC CTA CCG CCA CCC GAC CCT GCC CAG GCC CAG 499 
Leu Thr Arg Tyr Gly Tyr Leu Pro Pro Pro Asp Pro Ala Gin Ala Gin 
20 ~ ~ 25 30 

CTG CAG AGC CCT GAA AAT TTG CGC GAT GCC ATC AAA GTC ATG CAA AGG 547 
45 Leu Gin Ser Pro Glu Asn Leu Arg Asp Ala lie Lys Val Met Gin Arg 
35 40 45 50 

TTC GCG GGG CTG CCG GAG ACC GGC CGC ATG GAC CCA GGG ACA GTG GCC 595 
Phe Ala Gly Leu Pro Glu Thr Gly Arg Met Asp Pro Gly Thr Val Ala 
50 55 60 65 

ACC ATG CGT AAG CCC CGC TGC TCC CTG CCT GAC GTG CTG GGG GTG GCG 643 
Thr Met Arg Lys Pro Arg Cys Ser Leu Pro Asp Val Leu Gly Val Ala 
70 ~ 75 80 

55 GGG CTG GTC AGG CGG CGT CGC CGG TAC GGT CTG AGC GGC AGC GTG TGG 691 
Gly Leu Val Arg Arg Arg Arg Arg Tyr Gly Leu Ser Gly Ser Val Trp 
85 90 , 95 

'60 GAG AAG CGA ACC GTG ACA TGG AGG GTA CGT TCC TTC CCC CAG AGC TCC 73 9 

Glu Lys Arg Thr Val. Thr Trp Arg Val Arg Ser Phe Pro Gin Ser Ser 
100 105 . HO 

CAG GTG AGC CAG GAG ACC GTG CGG GTC CTC GTG AGC TAT GCC CTG ATG 7 87 
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Gin Val Ser Gin Glu Thr Val Arg Val Leu Val Ser Tyr Ala Leu Met 
115 120 125 130 

GCG TGG GGC ATG GAG TCA GGC CTC ACA TTT CAT GAG GTG GAT TCC CCC 83 5 

5 Ala Trp Gly Met Glu Ser Gly Leu Thr Phe His Glu Val Asp Ser Pro 

135 140 145 

CAG GGC CAG GAG CCC GAC ATC CTC ATA GAC TTT GCC CGC GCC TTC CAA 8 83 

Gin Gly Gin Glu Pro Asp lie Leu lie Asp Phe Ala Arg Ala Phe Gin 
10 150 155 160 

CAG GAC AGC TAC CCC TTC. GAC GGG TTG GGG GGC ACC CTA GCC CAT GCC 931 

Gin Asp Ser Tyr Pro Phe Asp Gly Leu Gly Gly Thr Leu Ala His Ala 
165 170 175 

15 

TTC TTC CCT GGG GAG CAC CCC ATC TCC GGG GAC ACT CAC TTT GAC GAT 979 

Phe Phe Pro Gly Glu His Pro lie Ser Gly Asp Thr His Phe Asp Asp 
180 185 190 

2 0 GAG GAG ACC TGG ACT . TTT GGG TCA AAA GAC GGC GAG GGG ACC GAC CTG 1027 
Glu Glu Thr Trp Thr Phe Gly Ser Lys Asp Gly Glu Gly Thr Asp Leu 
195 . 200 205 " 210 

TTT GCC GTG GCT GTC CAT GAG TTT GGC CAC GCC CTG GGC ATG GGC CAC 107 5 

25 Phe Ala Val Ala Val His Glu Phe Gly His Ala Leu Gly Met Gly His 

215 220 225 

TCC TCA GCC CCC GAC TCC ATT ATG AGG CCC TTC TAC CAG GGT CCG GTG 1123 
Ser Ser Ala -Pro Asp Ser lie Met Arg Pro Phe Tyr Gin Gly Pro Val 
30 230 235 240 

GGC GAC CCT GAC AAG TAC CGC CTG TCT CTG GAT GAC CGC GAT GGC CTG 1171 

Gly Asp Pro Asp Lys Tyr Arg Leu Ser Leu Asp Asp Arg Asp Gly Leu 

245 250 255 

35 

CAG CAA CTC TAT GGG AAG GCG CCC CAA ACC CCA TAT GAC AAG CCC ACA 1219 

Gin Gin Leu Tyr Gly Lys Ala Pro Gin Thr Pro Tyr Asp Lys Pro Thr 

260 265 270 

40 AGG AAA CCC CTG GCT CCT CCG CCC CAG CCC CCG GCC TCG CCC ACA CAC 12 67 

Arg Lys Pro Leu Ala Pro Pro Pro Gin Pro Pro Ala Ser Pro Thr His 
275 280 285 290 

AGC CCA TCC TTC CCC ATC CCT GAT CGA TGT GAG GGC AAT TTT GAC GCC 1315 
45 Ser Pro Ser Phe Pro lie Pro Asp Arg Cys Glu Gly Asn Phe Asp Ala 

295 300 305 

ATC GCC AAC ATC CGA GGG GAA ACT TTC TTC TTC AAA GGC CCC TGG TTC 13 6 3 

lie Ala Asn lie Arg Gly Glu Thr Phe Phe Phe Lys Gly Pro Trp Phe 
50 310 315 320 

TGG CGC CTC CAG CCC TCC GGA CAG CTG GTG TCC CCG CGA CCC GCA CGG 1411 

Trp Arg Leu Gin Pro Ser Gly Gin Leu Val Ser Pro Arg Pro Ala Arg 
325 330 335 

55 

CTG CAC CGC TTC TGG GAG GGG CTG CCC GCC CAG GTG AGG GTG GTG CAG 1459 

Leu His Arg Phe Trp Glu Gly Leu Pro Ala Gin Val Arg Val Val Gin 
340 345 350 

60 GCC GCC TAT GCT CGG CAC CGA GAC GGC CGA ATC CTC CTC TTT AGC GGG 1507 
Ala Ala Tyr Ala Arg. His Arg Asp Gly Arg lie Leu Leu Phe Ser Gly 
355 360 365 370 



CCC CAG TTC TGG GTG TTC CAG GAC CGG CAG CTG GAG GGC GGG GCG CGG 
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Pro Gin Phe Trp Val Phe Gin Asp Arg Gin Leu Glu Gly Gly Ala Arg 
375 380 385 

CCG CTC ACG GAG CTG GGG CTG CCC CCG GGA GAG GAG GTG GAC GCC GTG 16 03 

5 Pro Leu Thr Glu Leu Gly Leu Pro Pro Gly Glu Glu Val Asp Ala Val 
390 395 400- 

TTC TCG TGG CCA CAG AAC GGG' AAG ACC TAC CTG GTC CGC GGC CGG CAG 16 51 

Phe Ser Trp Pro Gin Asn Gly Lys Thr Tyr Leu Val Arg Gly Arg Gin 
10 405 410 415 

TAC TGG CGC TAC GAC GAG GCG GCG GCG CGC CCG GAC CCC GGC TAC CTT 169 9 

Tyr Trp Arg Tyr Asp Glu Ala Ala Ala Arg Pro Asp Pro Gly Tyr Leu 
420 ~ " 425 430 

15 

CGC GAC CTG AGC CTC TGG GAA.GGC GCG CCC CCC TCC CCT GAC GAT GTC 17 47 , 

Arg Asp Leu Ser Leu Trp Glu. Gly Ala Pro Pro Ser Pro Asp Asp Val 
435 440 445 450 

20 ACC GTC AGC AAC GCA GGT GAC ACC TAC TTC TTC AAG GGC GCC. CAC TAC 17 9 5 

Thr Val Ser Asn Ala Gly Asp Thr Tyr Phe Phe Lys Gly Ala His Tyr 
455 460 465 

TGG CGC TTC CCC AAG AAC AGC ATC AAG ACC GAG CCG GAC GCC CCC CAG 18 43 

25 Trp Arg Phe Pro Lys Asn Ser lie Lys Thr Glu Pro Asp Ala Pro Gin 

470 475 480 

CCC ATG GGG CCC AAC TGG CTG GAC TGC CCC GCC CCG AGC TCT GGT CCC 18 91 

Pro Met Gly Pro Asn Trp Leu Asp Cys Pro Ala Pro Ser Ser Gly Pro 
30 485" 490 495 

CGC GCC CCC AGG CCC CCC AAA GGG ACC CCC GTG TCC GAA ACC TGC GAT 193 9 

Arg Ala Pro Arg Pro Pro Lys Gly Thr Pro Val Ser Glu Thr Cys Asp 
500 505 510 

TGT CAG TGC GAG CTC AAC CAG GCC GCA GGA CGT TGG CCT GCT CCC ATC 19 87 

Cvs Gin Cys Glu Leu Asn Gin Ala Ala Gly Arg Trp Pro Ala Pro lie 
515 520 525 530 

CCG CTG CTC CTC TTG CCC CTG CTG GTG GGG GGT GTA GCC TCC CGC 2032 
Pro Leu Leu Leu Leu Pro Leu Leu Val Gly Gly Val Ala Ser Arg 
535 540 545 



35 



40 



45 


TGATGGGGGG 


AGC CATC CAG 


AC CGAAC AGC 


GCCCTCCACG 


GCCGAGTCCC 


CCGCCGCTGG 


2092 


ACCTGGTCGG 


GGGTTGTGAG 


GCGCTGCGGA 


GGCCCCTTGT 


CTGTTCCCAC 


GGACGGGGGC 


2152 




TCGGGCGCGG 


ACTAAGCAGG 


GGGGATCTCC 


CGCGCAGGGG 


CGGCGGCGGC 


GGGGAC CGGT 


2212 


50 


CGCCTGGCGC 


TGGGCTCAGT 


CTCCTCAGGG 


TCTGAGACCC 


CGGCGCTGCC 


ACCGGAACCC 


2272 




GCCTTCAGGG 


GCGCACGCGC 


GCTGGGACCA 


TGCGTCGGTC 


GTCGCCCCCG 


TCGTTCCCTC 


2332 


55 


CCGGCTGCCG 


CCAGGGGGCG 


GTCGGACCCC 


GCCTCCCGAG 


CCCGGGGAGG 


GGCGGGGAGG 


2392 


ACAAGGGGCG 


GGCCCGCGGC 


CTCACCCGGA 


GGGACGGCAG 


CCCCGGTCGC 


GCGCTGGCCC 


2452 




CGCAGGACCT 


TCCTTTTCCA 


GGAAGAGCCA 


GCTTTTCTCG 


GAGCGCAGTC 


CTGGGACTCT 


2512 


"60 


CCGCAGCCCC 


GCCCCGCCTG 


GCCACTGCGT 


CTGGCATTCC 


TGGGTCGTTA 


GAGGACAGGC 


2572 




CTGACTGCGA 


AGCTGTGCCT 


TGCCCCTCTC 


CCACCCGCAG 


TTTCTCACCC 


CGTTCTGCTC 


2632 




CCACAAGGCC 


CCCCTACAGT 


CACTGCCACA 


CTGGTGGGGA 


CCTGGGACCC 


AG AC CCGGAA 


2692 
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10 



15 



20 



25 



30 



35 



45 



50 



CCAGCCCAGA 


TATCACCCCT 


GAGGACCCAT 


GCGCCACGTC 


CTGGGTGGTG 


GAATCAGTGG 


2752 


GTGGAGGGAC 


GACCCTTGCT 


CTCCAGGCTG 


TTAACCTTTT 


CCGTTGCTCC 


CCCGCCACCC 


2812 


ACCTCCTCCT 


CCCCAGGCCA 


CCCAACTTGG 


GCACCTCCCT 


GGGCCCAGAA 


CTGCCTTCCA 


2872 


TTCAATGGGG 


AACCCTTCTA 


TCCCCAAGAA 


CCCCTTCCCT 


GCTTGCACCC 


TGGAGAGAAC 


2932 


AGCTTGACTC 


CCATCAACTC 


AACGCTGGTG 


GAAAGACAGG 


GACCGAACCC 


TGGCTCAGGC 


2992 


CTGGTCATTG 


CCTCCTCAGC 


ACTCCCTCCT 


GGGAGGCCTT 


AGCTCTAGAG 


TGAGGGGTGG 


3052 


GTGGAACCTG 


GGGGCACCTC 


GTTCACCCTG 


TCCCCACTCC 


CCACAGTTTT 


AGGATCTAAA 


3112 


TGATTGCCTC 


TGGAACTATT 


CTTC TAG AC T 


ATCCCACATC 


AGAATCACTG 


GGAAATTTAA 


3172 


GTTTGCAGAT 


CCCACACTCA 


CCCTGAATCC 


TCACTCAGGG 


TGGGGTCAGG 


AATCTGCATT 


3232 


TTAACTAGTC 


GCGGGGATTG 


TGGGGGGCAG 


TAGCTGGCTG 


TTTCGTGGCA 


TTTC TGTGGC 


3292 


TCTGCAGTGT 


TCCTCCACCC 


CAGGACCAAT 


ATGTTCAGGC 


CACACCGATG 


GCCTGAACCC 


3352 


CATGGGTAGA 


GTCACTTAGG 


GGCCACTTCC 


TAAGTTGCTG 


TCCAGCCTCA 


GTGACCCCCT 


3412 


AGTGCTTCCT 


GGAGCTGAGG 


CTGTGGGCGG 


CTGTCCCAbL 


AAL L.AAvjL.VaA 


GGGvjI iGCL-L. 


j4 / / 


CAGTTGCTCA 


TACAAACAGA 


TCAGCATGAG 


GACAGAAGGC 


AGGAGACTTT 


GGTCAGTTAC 


3532 


CTGGGAATTC 


TGGGCTGCCA 


GGAAAC G ATT 


TGGGCCTCTG 


TCAGTTTCTT 


TTCCATGTAT 


3592 


GAGGAGGGGG 


AAATTTGTAT 


ATTAGATACT 


TATTCATCCC 


ACTCTGGACA 


ATAAAAACGA 


3652 


ATGTACAAAA 


AAAACATAAA 


AAAAAAAAAT 


AAAGAAAATC 


AAA 




3695 



(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 563 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Arg Leu Arg Leu Arg Leu Leu Ala Leu Leu Leu Leu His Ala Gly 
-18 ~ -15 -10 -5 

Thr Ala Arg Ala Arg Pro Glu Ala Leu Gly Ala Gly Leu Ser Leu Gly 
15 10 



Cys Glu Asn Trp Leu Thr Arg Tyr Gly Tyr Leu Pro Pro Pro Asp Pro 
55 15 20 25 30 

Ala Gin Ala Gin Leu Gin Ser Pro Glu Asn Leu Arg Asp Ala lie Lys 
35 40 45 

60 Val Met Gin Arg Phe Ala G^y Leu Pro Glu Thr Gly Arg Met Asp Pro 

50 55 60 

Gly Thr Val Ala Thr Met Arg Lys Pro Arg Cys Ser Leu Pro Asp Val 
65 70 75 
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Leu Gly Val Ala Gly Leu Val Arg Arg Arg Arg Arg Tyr Gly Leu" .Ser 
80 85 90 

Gly Ser Val Trp Glu Lys Arg. Thr Val Thr Trp Arg Val Arg Ser Phe 
95 100 105 110 

Pro Gin Ser Ser Gin Val Ser Gin Glu Thr Val Arg Val Leu Val Ser 
115 120 125 

Tyr Ala Leu Met Ala Trp Gly Met Glu Ser Gly Leu Thr Phe His Glu 
130 135 140 



Val Asp Ser Pro Gin Gly Gin Glu Pro Asp lie Leu lie Asp Phe Ala 

15 145 150 155 

Arg Ala Phe Gin Gin Asp Ser Tyr Pro Phe Asp Gly Leu Gly Gly Thr 

160 165 170 

20 Leu Ala His Ala Phe Phe Pro Gly Glu His Pro lie Ser Gly Asp Thr 

.175 180 185 190 



25 



His Phe Asp Asp Glu Glu Thr Trp Thr Phe Gly Ser Lys Asp Gly Glu 
195 200 205 

Gly Thr Asp Leu Phe Ala Val Ala Val His Glu Phe Gly His Ala Leu 
210 215 220 



Gly Met Gly His Ser Ser Ala Pro Asp Ser He Met Arg Pro Phe Tyr 
3 0 ' 225 ' 230 235 

Gin Gly Pro Val Gly Asp Pro Asp Lys Tyr Arg Leu Ser Leu Asp Asp 
240 245 250 

3 5 Arg Asp Gly Leu Gin Gin Leu Tyr Gly Lys Ala Pro Gin Thr Pro Tyr 
255 ^ 260 265 270 



40 



Asp Lys Pro Thr Arg Lys Pro Leu Ala Pro Pro Pro Gin Pro Pro Ala 
275 280 285 

Ser Pro Thr His Ser Pro Ser Phe Pro He Pro Asp Arg Cys Glu Gly 
290 295 300 



Asn Phe Asp Ala He Ala Asn He Arg Gly Glu Thr Phe Phe Phe Lys 
45 305 310 315 

Gly Pro Trp Phe Trp Arg Leu Gin Pro Ser Gly Gin Leu Val Ser Pro 
320 325 330 

50 Arg Pro Ala Arg Leu His Arg Phe Trp Glu Gly Leu Pro Ala Gin Val 
335 340 345 350 



55 



Arg Val Val Gin Ala Ala Tyr Ala Arg His Arg Asp Gly Arg He Leu 

355 360 365 

Leu Phe Ser Gly Pro Gin Phe Trp Val Phe Gin Asp Arg Gin Leu Glu 

370 375 380 



Gly Gly Ala Arg Pro Leu Thr Glu Leu Gly Leu Pro Pro Gly Glu Glu 
60 " 385 "* 390 395 

Val Asp Ala Val Phe Ser Trp Pro Gin Asn Gly Lys Thr Tyr Leu Val 
400 405 410 
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Arg Gly Arg Gin Tyr Trp Arg Tyr Asp Glu Ala Ala Ala Arg Pro Asp 
415 420 425 430 

Pro Gly Tyr Leu Arg Asp Leu Ser Leu Trp Glu Gly Ala Pro Pro Ser 
5 435 440 445 

Pro Asp Asp Val Thr Val Ser Asn Ala Gly Asp Thr Tyr Phe Phe Lys 
450 455 460 

_10 Gly Ala His Tyr Trp Arg Phe Pro Lys Asn Ser lie Lys Thr Glu Pro 
465 470 475 



15 



35 



55 



Asp Ala Pro Gin Pro Met Gly Pro Asn Trp Leu Asp Cys Pro Ala Pro 

480 485 490 

Ser Ser Gly Pro Arg Ala Pro Arg Pro Pro Lys Gly Thr Pro Val Ser 

495 500 505 510 



Glu Thr Cys Asp Cys Gin Cys Glu Leu Asn Gin Ala Ala Gly Arg Trp 
20 515 520 525 

Pro Ala Pro lie Pro Leu Leu Leu Leu Pro Leu Leu Val Gly Gly Val 
530 535 540 

25 Ala Ser Arg 
545 

(2) INFORMATION FOR SEQ ID NO : 3 : 

3 0 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3691 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 
40 (A) NAME / KEY : CDS 

(B) LOCATION: 343.. 2028 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 
45 (B) LOCATION: 406.. 2028 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION: 3454 

50 (D) OTHER INFORMATION: /note= "nucleotide 3454 is W, 

designated C herein" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CATGCAACAT AATCTTGCTC GATTCTAAAG TCAACGGATC CTGCAAAATT CGCGGCCGCG 60 

TCAACCCATT AGGTCTTGGC CTTGGAATAA AATTGCTTCT CGTCTGATTC CCGGGCCCAC 12 0 

6 0 CCGACCCAGC GGCGCAACCC TGGCCCTCCG GGACCCTCCG . CTGACTCCAC CGCGCACTTC 18 0 

CCGGGACCCC CACACACATC CCAGCCCTCC GGCCGATCCC TCCCTACTCG GTGCCGGGTG 2 40 

CCCCCCGCCC TCTCCAGGCC CGGATCTCCT CCCCCAGGTC CCCGGGGCGG CCCCAGCCAG 3 00 
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GCCCCCTTCG AACCCCGCCG GCGGCCCGGG CTGGGGCGCA CC ATG CGG CTG CGG 354 

Met Arg Leu Arg 
-21 -20 

5 

CTC CGG CTT CTG GCG CTG CTG CTT CTG CTG CTG GCA CCG CCC GCG CGC 402 
Leu Arg Leu Leu Ala Leu Leu Leu Leu Leu Leu Ala Pro Pro Ala Arg 
-15 -10 "5 

10 GCC CCG AAG CCC TCG GCG CAG GAC GTG AGC CTG GGC GTG GAC TGG CTG 450 
Ala Pro Lys Pro Ser Ala Gin Asp Val Ser Leu Gly Val Asp Trp Leu 
15 10 15 

ACT CGC TAT GGT TAC CTG CCG CCA CCC CAC CCT GCC CAG GCC CAG CTG 498 
15 Thr Arg Tyr Gly Tyr Leu Pro Pro Pro His Pro Ala Gin Ala Gin Leu 

20 25 30 

CAG AGC CCT GAG AAG TTG CGC GAT GCC ATC AAA GTC ATG CAG AGG TTC 546 
Gin Ser Pro Glu Lys Leu Arg Asp Ala lie Lys Val Met Gin Arg Phe 
20 35 40 45 

GCG GGG CTG CCG GAG ACC GGC CGC ATG GAC CCA GGG ACA GTG GCC ACC 594 
Ala Gly Leu Pro Glu Thr Gly Arg Met Asp Pro Gly Thr Val Ala Thr 
50 55 60 



25 



45 



ATG CGT AAG CCC CGC TGC TCC CTG CCT GAC GTG CTG GGG GTG GCG GGG 642 
Met Arg Lys Pro Arg Cys Ser Leu Pro Asp Val Leu Gly Val Ala Gly 
65 70 75 



3 0 CTG GTC AGG- CGG CGT CGC CGG TAC GCT CTG AGC GGC AGC GTG TGG AAG 690 
Leu Val Arg Arg Arg Arg Arg Tyr Ala Leu Ser Gly Ser Val Trp Lys 
80 85 90 95 

AAG CGA ACC CTG ACA TGG AGG GTA CGT TCC TTC CCC CAG AGC TCC CAG 738 
3 5 Lys Arg Thr Leu Thr Trp Arg Val Arg Ser Phe Pro Gin Ser Ser Gin 

100 105 HO 

CTG AGC CAG GAG ACC GTG CGG GTC CTC ATG AGC TAT GCC CTG ATG GCC 786 
Leu Ser Gin Glu Thr Val Arg Val Leu Met Ser Tyr Ala Leu Met Ala 
40 115 120 125 

TGG GGC ATG GAG TCA GGC CTC ACA TTT CAT GAG GTG GAT TCC CCC CAG 834 
Tro Gly Met Glu Ser Gly Leu Thr Phe His Glu Val Asp Ser Pro Gin 
130 " 135 140 



GGC CAG GAG CCC GAC ATC CTC ATC GAC TTT GCC CGC GCC TTC CAC CAG 
Gly Gin Glu Pro Asp He Leu He Asp Phe Ala Arg Ala Phe His Gin 
145 150 155 



882 



50 GAC AGC TAC CCC TTC GAC GGG TTG GGG GGC ACC CTA GCC CAT GCC TTC 9 30 

Asp Ser Tyr Pro Phe Asp Gly Leu Gly Gly Thr Leu Ala His Ala Phe 
160 165 170 175 

TTC CCT GGG GAG CAC CCC ATC TCC GGG GAC ACT CAC TTT GAC GAT GAG 97 8 

55 Phe Pro Gly Glu His Pro He Ser Gly Asp Thr His Phe Asp Asp Glu 

180 185 190 

GAG ACC TGG ACT TTT GGG TCA AAA GAC GGC GAG GGG ACC GAC CTG TTT 1026 
Glu Thr Trp Thr Phe Gly Ser Lys Asp Gly Glu Gly Thr Asp Leu Phe 
- 60 195 200 205 

GCC GTG GCT GTC CAT GAG TTT GGC CAC GCC CTG GGC CTG GGC CAC TCC 1074 
Ala Val Ala Val His Glu Phe Gly His Ala Leu Gly Leu. Gly His Ser 
210 215 220 
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TCA GCC CCC AAC TCC ATT ATG AGG CCC TTC TAC CAG GGT CCG GTG GGC 1122 

Ser Ala Pro Asn Ser lie Met Arg Pro Phe Tyr Gin Gly Pro Val Gly 

225 230 235 

5 

GAC CCT GAC AAG TAC CGC CTG TCT CAG GAT GAC CGC GAT GGC CTG CAG 117 0 

Asp Pro Asp Lys Tyr Arg Leu Ser Gin Asp Asp Arg Asp Gly Leu Gin 
240 245 250 ~ 255 

10 CAA CTC TAT GGG AAG GCG CCC CAA ACC CCA TAT GAC AAG CCC ACA AGG 1218 
Gin Leu Tyr Gly Lys Ala Pro Gin Thr Pro Tyr Asp Lys Pro Thr Arg 
260 265 270 

AAA CCC CTG GCT CCT CCG CCC CAG CCC CCG GCC TCG CCC ACA CAC AGC 12 6 6 

15 Lys Pro Leu Ala Pro Pro Pro Gin Pro Pro Ala Ser Pro Thr His Ser 

275 280 285 

CCA TCC TTC CCC ATC CCT GAT CGA TGT GAG GGC AAT TTT GAC GCC ATC 1314 
Pro Ser Phe Pro lie Pro Asp Arg Cys Glu Gly Asn Phe Asp Ala lie 
20 290 295 300 

GCC AAC ATC CGA GGG GAA ACT TTC TTC TTC AAA GGC CCC TGG TTC TGG 13 6 2 

Ala Asn lie Arg Gly Glu Thr Phe Phe Phe Lys Gly Pro Trp Phe Trp 
305 310 315 



25 



45 



CGC CTC CAG CCC TCC GGA CAG CTG GTG TCC CCG CGA CCC GCA CGG CTG 1410 
Arg Leu Gin Pro Ser Gly Gin Leu Val Ser Pro Arg Pro Ala Arg Leu 
320 325 -330 335 



3 0 CAC CGC TTC TGG GAG GGG CTG CCC GCC CAG GTG AGG GTG GTG CAG GCC 145 8, 

His Arg Phe Trp Glu Gly Leu Pro Ala Gin Val Arg Val Val Gin Ala 
340 345 350 

GCC TAT GCT CGG CAC CGA GAC GGC CGA ATC CTC CTC TTT AGC GGG CCC 150 6 

35 Ala Tyr Ala Arg His Arg Asp Gly Arg lie Leu Leu Phe Ser Gly Pro 

355 360 365 

CAG TTC TGG GTG TTC CAG GAC CGG CAG CTG GAG GGC GGG GCG CGG CCG 155 4 

Gin Phe Trp Val Phe Gin Asp Arg Gin Leu Glu Gly Gly Ala Arg Pro 
40 370 375 380 

CTC ACG GAG CTG GGG CTG CCC CCG GGA GAG GAG GTG GAC GCC GTG TTC 1602 
Leu Thr Glu Leu Gly Leu Pro Pro Gly Glu Glu Val Asp Ala Val Phe 
385 390 395 



TCG TGG CCA CAG AAC GGG AAG ACC TAC CTG GTC CGC GGC CGG CAG TAC 1650 
Ser Trp Pro Gin Asn Gly Lys Thr Tyr Leu Val Arg Gly Arg Gin Tyr 
400 405 410 415 



50 TGG CGC TAC GAC GAG GCG GCG GCG CGC CCG GAC CCC GGC TAC CCT CGC 169 8 

Trp Arg Tyr Asp Glu Ala Ala Ala Arg Pro Asp Pro Gly Tyr Pro Arg 
420 425 430 

GAC CTG AGC CTC TGG GAA GGC GCG CCC CCC TCC CCT GAC GAT GTC ACC 17 4 6 

55 Asp Leu Ser Leu Trp Glu Gly Ala Pro Pro Ser Pro Asp Asp Val Thr 

435 440 445 

GTC AGC AAC GCA GGT GAC ACC TAC TTC TTC AAG . GGC GCC CAC TAC TGG 17 9 4 

Val Ser Asn Ala Gly Asp Thr Tyr Phe Phe Lys Gly Ala His Tyr Trp 
60 450 455 460 - 

CGC TTC CCC AAG AAC AGC ATC AAG ACC GAG CCG GAC GCC CCC CAG CCC 1842 
Arg Phe Pro Lys Asn Ser lie Lys Thr Glu Pro Asp Ala Pro Gin Pro 
465 470 475 
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25 



ATG GGG CCC AAC TGG CTG GAC TGC CCC GCC CCG AGC TCT GGT CCC CGC 189 0 
Met Gly Pro Asn Trp Leu Asp Cys Pro Ala Pro Ser Ser Gly Pro Arg 
480 ~ 485 490 495 

GCC CCC AGG CCC CCC AAA GCG ACC CCC GTG TCC GAA ACC TGC GAT TGT 193 8 
Ala Pro Arg Pro Pro Lys Ala Thr Pro Val Ser Glu Thr Cys Asp Cys 
500 505 510 

10 CAG TGC GAG CTC AAC CAG GCC GCA GGA CGT TGG CCT GCT CCC ATC CCG 198 6 
Gin Cys Glu Leu Asn Gin Ala Ala Gly Arg Trp Pro Ala Pro He Pro 
515 520 525 

CTG CTC CTC TTG CCC CTG CTG GTG GGG GGT GTA GCC TCC CGC 2 02 8 
15 Leu Leu Leu Leu Pro Leu Leu Val Gly Gly Val Ala Ser Arg 
530 535 540 

TGATGGGGGG AGCCATCCAG AC CG AAC AGC GCCCTCCACG GCCGAGTCCC CCGCCGCTGG 20 8 8 

2 0 ACCTGGTCGG GGGTTGTGAG GCGCTGCGGA GGCCCCTTGT CTGTTCCCAC GGACGGGGGC 214 8 

TCGGGCGCGG ACTAAGCAGG GGGGATCTCC CGCGCAGGGG CGGCGGCGGC GGGGACCGGT 22 0 8 

CGCCTGGCGC TGGGCTCAGT CTCCTCAGGG TCTGAGACCC CGGCGCTGCC ACCGGAACCC 22 6 8 

GCCTTCAGGG GCGCACGCGC GCTGGGACCA TGCGTCGGTC GTCGCCCCCG TCGTTCCCTC 23 2 8 

CCGGCTGCCG CCAGGGGGCG GTCGGACCCC GCCTCCCGAG CCC GGGGAGG GGCGGGGAGG 2388 

3 0 ACAAGGGGCG GGCCCGCGGC CTCACCCGGA GGGACGGCAG CCCCGGTCGC GCGCTGGCCC 2448 

CGC AGG AC CT TCCTTTTCCA GGAAGAGCCA GCTTTTCTCG GAGCGCAGTC CTGGGACTCT 2508 

CCGCAGCCCC GCCCCGCCTG GCCACTGCGT CTGGCATTCC TGGGTCGTTA GAGGACAGGC 2568 

CTGACTGCGA AGCTGTGCCT TGCCCCTCTC CCACCCGCAG TTTCTCACCC CGTTCTGCTC 2 62 8 

CCACAAGGCC CCCCTACAGT CACTGCCACA CTGGTGGGGA CCTGGGACCC AGACCCGGAA 2 688 

40 CCAGCCCAGA TATCACCCCT GAGGACCCAT GCGCCACGTC CTGGGTGGTG GAATCAGTGG 27 48 

CTGGAGGGAC GACCCTTGCT CTCCAGGCTG TTAACCTTTT CCGTTGCTCC CCCGCCACCC 2 8 08 

ACCTCCTCCT CCCCAGGCCA CCCAACTTGG GCACCTCCCT GGGCCCAGAA CTGCCTTCCA 2 8 68 

TTCAATGGGG AACCCTTCTA TCCCCAAGAA CCCCTTCCCT GCTTGCACCC TGGAGAGAAC 29 2 8 

AGCTTGACTC CCATCAACTC AACGCTGGTG GAAAGACAGG GACCGAACCC TGGCTCAGGC 2 9 88 

50 CTGGTCATTG CCTCCTCAGC ACTCCCTCCT GGGAGGC CTT AGCTC TAG AG TGAGGGGTGG 3 04 8 

GTGGAAC CTG GGGGCACCTC GTTCACCCTG TCCCCACTCC C C AC AGTTTT AGGATCTAAA 310 8 

TGATTGCCTC TGGAACTATT C TTC TAG AC T ATCCCACATC AGAATCACTG GGAAATTTAA 3168 

GTTTG CAGAT CCCACACTCA CCCTGAATCC TCACTCAGGG TGGGGTCAGG AATCTGCATT 322 8 

TTAACTAGTC GCGGGGATTG TGGGGGGCAG TAGCTGGCTG TTTCGTGGCA TTTCTGTGGC 3288 

60 TCTGCAGTGT TCCTCCACCC CAGGACCAAT ATGTTCAGGC C AC AC CG ATG GCCTGAACCC 3 34 8 

CATGGGTAGA GTCACTTAGG GGCCACTTCC TAAGTTGCTG TCCAGCCTCA GTGACCCCCT 340 8 

AGTGCTTCCT GG AG C TG AGG CTGTGGGCGG CTGTCCCAGC AACCACGCGA GGGGTTGCCC 3 46 8 



35 



45 



55 
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10 



30 



45 



60 



CAGTTGCTCA TACAAACAGA TCAGCATGAG GACAGAAGGC AGGAGACTTT GGTCAGTTAC 3 52 8 

CTGGGAATTC TGGGCTGCCA GGAAACGATT TGGGCCTCTG TCAGTTTCTT TTCCATGTAT 3 58 3 

GAGGAGGGGG AAATTTGTAT ATTAGATACT TATTCATCCC ACTCTGGACA ATAAAAACGA 3 64 8 

ATGTACAAAA AAAACATAAA AAAAAAAAAT AAAGAAAATC AAA 3 691 

(2) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 562 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Arg Leu Arg Leu Arg Leu Leu Ala Leu Leu Leu Leu Leu Leu Ala 
-21 -20 ~ -15 -10 

25 Pro Pro Ala Arg Ala Pro Lys Pro Ser Ala Gin Asp Val Ser Leu Gly 
-5 15 10 

Val Asp Trp Leu Thr Arg Tyr Gly Tyr Leu Pro Pro Pro His Pro Ala 
15 20 25 



Gin Ala Gin Leu Gin Ser Pro Glu Lys Leu Arg Asp Ala lie Lys Val 
30 35 40 



Met Gin Arg Phe Ala Gly Leu Pro Glu Thr Gly Arg Met Asp Pro Gly 
35 45 50 55 

Thr Val Ala Thr Met Arg Lys Pro Arg Cys Ser Leu Pro Asp Val Leu 
60 65 70 75 

40 Gly Val Ala Gly Leu Val Arg Arg Arg Arg Arg Tyr Ala Leu Ser Gly 

80 85 90 



Ser Val Trp Lys Lys Arg Thr Leu Thr Trp Arg Val Arg Ser Phe Pro 
95 100 105 

Gin Ser Ser Gin Leu Ser Gin Glu Thr Val Arg Val Leu Met Ser Tyr 
110 115 120 



Ala Leu Met Ala Trp Gly Met Glu Ser Gly Leu Thr Phe His Glu Val 
50 125 -130 135 

Asp Ser Pro Gin Gly Gin Glu Pro Asp lie Leu lie Asp Phe Ala Arg 
140 145 150 155 

55 Ala Phe His Gin Asp Ser Tyr Pro Phe Asp Gly Leu Gly Gly Thr Leu 

160 165 170 



Ala His Ala Phe Phe Pro Gly Glu His Pro lie Ser Gly Asp Thr His 

175 180 185 

Phe Asp Asp Glu Glu Thr Trp Thr Phe Gly Ser Lys Asp Gly Glu Gly 
190 195 200 

Thr Asp Leu Phe Ala Val Ala Val His Glu Phe Gly His Ala Leu Gly 
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205 210 215 

Leu Gly His Ser Ser Ala Pro Asn Ser lie Met Arg Pro Phe Tyr Gin 
220 225 230 235 

5 

Gly Pro Val Gly Asp Pro Asp Lys Tyr Arg Leu Ser Gin Asp Asp Arg 
240 245 250 

Asp Gly Leu Gin Gin Leu Tyr Gly Lys Ala Pro Gin Thr Pro Tyr Asp 
10 255 260 265 

Lys Pro Thr Arg Lys Pro Leu Ala Pro Pro Pro Gin Pro Pro Ala Ser 
270 275 280 

15 Pro Thr His Ser Pro Ser Phe Pro lie Pro Asp Arg Cys Glu Gly Asn 
285 290 295 



20 



35 



50 



Phe Asp Ala lie Ala Asn lie Arg Gly Glu Thr Phe Phe Phe Lys Gly 

300 305 310 315 

Pro Trp Phe Trp Arg Leu Gin Pro Ser. Gly Gin Leu .Val Ser - Pro Arg 

320 325 ■ 330 



Pro Ala Arg Leu His Arg Phe Trp Glu Gly Leu Pro Ala Gin Val Arg 
25 " 335 340 345 

Val Val Gin Ala Ala Tyr Ala Arg His Arg Asp Gly Arg lie Leu Leu 
350 ... 355 360 

3 0 Phe Ser Gly Pro Gin Phe Trp Val Phe Gin Asp Arg Gin Leu Glu Gly 

365 ~ 370 375 



Gly Ala Arg Pro Leu Thr Glu Leu Gly Leu Pro Pro Gly Glu Glu Val 

380 385 390 395 

Asp Ala Val Phe Ser Trp Pro Gin Asn Gly Lys Thr Tyr Leu Val Arg 
400 405 410 



Gly Arg Gin Tyr Trp Arg Tyr Asp Glu Ala Ala Ala Arg Pro Asp Pro 
40 415 420 425 

Gly Tyr Pro Arg Asp Leu Ser Leu Trp Glu Gly Ala Pro Pro Ser Pro 
430 435 440 

45 Asp Asp Val Thr Val Ser Asn Ala Gly Asp Thr Tyr Phe Phe Lys Gly 
445 450 455 



Ala His Tyr Trp Arg Phe Pro Lys Asn Ser He Lys Thr Glu Pro Asp 

460 465 470 475 

Ala Pro Gin Pro Met Gly Pro Asn Trp Leu Asp Cys Pro Ala Pro Ser 

480 485 490 



Ser Gly Pro Arg Ala Pro Arg Pro Pro Lys Ala Thr Pro Val Ser Glu 
55 ~ 495 500 505 

Thr Cys Asp Cys Gin Cys Glu Leu Asn Gin Ala Ala Gly. Arg Trp Pro 
510 515 520 

60 Ala Pro He Pro Leu Leu Leu Leu Pro Leu Leu Val Gly Gly Val Ala 
525 530 535 

Ser Arg 
540 
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J ( 2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single ^ - 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer 11 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CAGAAATGCC ACGAAACAGC CAGGTACT 2 8 

20 (2) INFORMATION FOR SEQ ID- NO: 6: 

(i): 'SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE : nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



35 



(ii) MOLECULE TYPE: other nucleic acid 
-(A) DESCRIPTION: /desc = "primer 1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GCCCCAGTTG CTCATACAAA CAGATCAG 2 8 

(2) INFORMATION FOR SEQ ID NO : 7 : 



40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

45 

(ii) MOLECULE TYPE: peptide 



50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Gin Gin Phe Gly Gly Leu Glu Ala Thr Gly lie Leu Asp Glu Ala 
15 10 15 

55 

Thr Leu Ala Leu Met Lys Thr Pro Arg Cys Ser Leu Pro Asp Leu Pro 
20 25 30 

Val Leu Thr Gin Ala Arg Arg Arg Arg Gin Ala Pro Ala Pro Thr Lys 
60 35 40 45 

Trp Asn Lys Arg Asn Leu Ser Trp Arg Val Arg Thr Phe Pro Arg Asp 
50 55 60 
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Ser Pro Leu Gly His Asp Thr Val Arg Ala Leu Met Tyr Tyr Ala Leu 

65 70 75 ~ 80 

Lys Val Trp Ser Asp lie Ala Pro Leu Asn Phe His Glu Val Ala Gly 
5 85 90 95 

Ser Thr Ala Asp lie Gin lie Asp Phe Ser Lys Ala Asp His Asn Asp 
100 105 ' no 

10 Gly Tyr Pro Phe Asp Gly Pro Gly Gly Thr Val Ala His Ala Phe Phe 

115 120 125 



15 



30 



45 



60 



Pro Gly His His His Thr Ala Gly Asp Thr His Phe Asp Asp Asp Glu 

130 135 140 

Ala Trp Thr Phe Arg Ser Ser Asp Ala His Gly Met Asp Leu Phe Ala 

145 150 155 160 



Val Ala Val His Glu Phe Gly His Ala lie Gly Leu Ser His Val Ala 
20 165 170 175 

Ala Ala His Ser lie Met Arg Pro Tyr Tyr Gin Gly Pro Val Gly Asp 
180 185 190 

25 Pro Leu Arg Tyr Gly Leu Pro Tyr Glu Asp Lys Val Arg Val Trp Gin 

195 200 205 



Leu Tyr Gly Val Arg Glu Ser Val Ser Pro Thr Ala Gin Pro Glu Glu 
210 215 220 

Pro Pro Leu Leu Pro Glu Pro Pro Asp Asn Arg Ser Ser Ala Pro Pro 

225 230 235 240 



Arg Lys Asp Val Pro His Arg Cys Ser Thr His Phe Asp Ala Val Ala 

35 245 250 255 

Gin lie Arg Gly Glu Ala Phe Phe Phe Lys Gly Lys Tyr Phe Trp Arg 

260 265 270 

40 Leu Thr Arg Asp Arg His Leu Val Ser Leu Gin Pro Ala Gin Met His 

275 280 285 



Arg Phe Trp Arg Gly Leu Pro Leu- His Leu Asp Ser Val Asp Ala Val 

290 295 300 

Tyr Glu Arg Thr Ser Asp His Lys lie Val Phe Phe Lys Gly Asp Arg 

305 310 315 " 320 



Tyr Trp Val Phe Lys Asp Asn Asn Val Glu Glu Gly Tyr Pro Arg Pro 
50 325 330 335 

Val Ser Asp Phe Ser Leu Pro Pro Gly Gly lie Asp Ala Ala Phe Ser 
340 345 350 

55 Trp Ala His Asn Asp Arg Thr Tyr Phe Phe Lys Asp Gin Leu Tyr Trp 

355 360 365 



Arg Tyr Asp Asp His Thr Arg His Met Asp Pro Gly Tyr Pro Ala Gin 
370 375 380 

Ser Pro Leu Trp Arg Gly Val Pro Ser Thr Leu Asp Asp Ala Met Arg 

385 390 395 " 400 

Trp Ser Asp Gly Ala Ser Tyr Phe Phe Arg Gly Gin Glu Tyr Trp Lys 
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405 410 . 415 

Val Leu Asp Gly Glu Leu Glu Val Ala Pro Gly Tyr Pro Gin Ser Thr 
420 425 430 

5 

Ala Arg Asp Trp Leu Val Cys Gly Asp Ser Gin Ala Asp Gly Ser Val 
435 440 445 

Ala Ala Gly Val Asp Ala Ala Glu Gly Pro Arg Ala Pro Pro Gly Gin 
10 450 J 455 460 

His Asp Gin Ser Arg Ser Glu Asp Gly Tyr Glu Val Cys Ser Cys Thr 
465 470 475 " 480 

15 Ser Gly Ala Ser Ser Pro Pro Gly Ala Pro Gly Pro Leu Val Ala Ala 

485 490 495 



20 



25 



35 



50 



Thr Met Leu Leu Leu Leu Pro Pro Leu Ser Pro Gly Ala Leu Trp Thr 
500 505 510 

Ala Ala Gin Ala Leu Thr Leu 
515 

(2) INFORMATION FOR SEQ ID NO : 8 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 64 amino acids 
<B) TYPE: amino acid 
. (C) STRANDEDNESS: not relevant 
3 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Met Lys Arg Pro Arg Cys Gly Val Pro Asp Gin Phe Gly Val Arg VaJL 
40 1 5 10 15 

Lys Ala Asn Leu Arg Arg Arg Arg ' Lys Arg Tyr Ala Leu Thr Gly Arg 
20 25 30 

45 Lys Trp Asn Asn His His Leu Thr Phe Ser lie Gin Asn Tyr Thr Glu 

35 40 45 

Lys Leu Gly Trp Tyr His Ser Met Glu Ala Val Arg Arg Ala Phe Arg 
50 "* 55 60 



Val Trp Glu Gin Ala Thr Pro Leu Val Phe Gin Glu Val Pro Tyr Glu 
65 " 70 75 80\ 



Aso lie Arg Leu Arg Arg Gin Lys Glu Ala Asp lie Met Val Leu Phe 
55 " 85 90 95 

Ala Ser Gly Phe His Gly Asp Ser Ser Pro Phe Asp Gly Thr Gly Gly 
100 105 110 

60 Phe Leu Ala His Ala Tyr Phe Pro Gly Pro Gly Leu Gly Gly Asp Thr 

115 120 125 

His Phe Asp Ala Asp Glu Pro Trp Thr Phe Ser Ser Thr Asp Leu His 
130 135 140 
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10 



25 



40 



55 



Gly Asn Asn Leu Phe. Leu Val Ala Val His Glu Leu Gly His Ala Leu 
145 150 155 160 

Gly Leu Glu His Ser Ser Asn Pro Asn Ala lie Met Ala Pro Phe Tyr 
165 170 175 

Gin Trp Lys Asp Val Asp Asn Phe Lys Leu Pro Glu Asp Asp Leu Arg 
180 185 190 

Gly lie Gin Gin Leu Tyr Gly Thr Pro Asp Gly Gin Pro Gin Pro Thr 
195 200 205 



Gin Pro Leu Pro Thr Val Thr Pro Arg Arg Pro Gly Arg Pro Asp His 

15 210 215 220 

Arg Pro Pro Arg Pro Pro Gin Pro Pro Pro Pro Gly Gly Lys Pro Glu 

225 230 235 240 

20 Arg Pro Pro Lys Pro Gly Pro Pro Val Gin Pro Arg Ala Thr Glu Arg 

245 " 250 255 



Pro Asp Gin Tyr Gly Pro Asn lie Cys Asp Gly Asp Phe Asp Thr Val 
260 265 270 

Ala Met Leu Arg Gly Glu Met Phe Val Phe Lys Gly Arg Trp Phe Trp 

275 280 285 



Arg Val Arg His Asn Arg Val Leu Asp Asn Tyr Pro Met Pro lie Gly 
30 290 295 300- 

His Phe Trp Arg Gly Leu Pro Gly Asp He Ser Ala Ala Tyr Glu Arg 
305 " ~ 310 315 320 

3 5 Gin Asp Gly Arg Phe Val Phe Phe Lys Gly Asp Arg Tyr Trp Leu Phe 

325 330 335 



Arg Glu Ala Asn Leu Glu Pro Gly Tyr Pro Gin Pro Leu Thr Ser Tyr 
340 345 350 

Gly Leu Gly He Pro Tyr Asp Arg He Asp Thr Ala He Trp Trp Glu 
355 360 365 



Pro Thr Gly His Thr Phe Phe Phe Gin Glu Asp Arg Tyr Trp Arg Phe 
45 370 375 380 

Asn Glu Glu Thr Gin Arg Gly Asp Pro Gly Tyr Pro Lys Pro He Ser 
385 390 395 400 

50 Val Trp Gin Gly He Pro Ala Ser Pro Lys Gly Ala Phe Leu Ser Asn 

405 410 415 



Asp Ala Ala Tyr Thr Tyr Phe Tyr Lys Gly Thr Lys Tyr Trp Lys Phe 
420 425 430 

Asp Asn Glu Arg Leu Arg Met Glu Pro Gly Tyr Pro Lys Ser He Leu 
435 440 445 



Arg Asp Phe Met Gly Cys Gin Glu His Val Glu Pro Gly Pro Arg Trp 

60 450 " 455 460 

Pro Asp Val Ala Arg Pro Pro Phe Asn Pro His Gly Gly Ala Glu Pro 
465 " 470 475 480 
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Gly Ala Asp Ser Ala Glu Gly Asp Val Gly Asp Gly Asp Gly Asp Phe 
485 490 495 

Gly Ala Gly Val Asn Lys Asp Arg Gly Ser Arg Val Val Val Gin Met 
5 500 505 510 

Glu Glu Val Ala Arg Thr Val Asn Val Val Met Val Leu Val Pro Leu 
515 520 525 

10 Leu Leu Leu Leu Cys Val Leu Gly Leu Thr Tyr Ala Leu Val Gin Met 

530 535 540 

Gin Arg Lys Gly Ala Pro Arg Val Leu Leu Tyr Cys Lys Arg Ser Leu 
545 550 555 " 560 

15 

Gin Glu Trp Val 



20 



(2) INFORMATION FOR SEQ ID NO : 9 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
25 (D) TOPOLOGY: linear 



30 



45 



60 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



Met Ser Pro Ala Pro Arg Pro Ser Arg Cys Leu Leu Leu Pro Leu Leu 
35 1 5 10 15 

Thr Leu Gly Thr Ala Leu Ala Ser Leu Gly Ser Ala Gin Ser Ser Ser 
20 25 30 

40 Phe Ser Pro Glu Ala Trp Leu Gin Gin Tyr Gly Tyr Leu Pro Pro Gly 

35 40 45 



Asp Leu Arg Thr His Thr Gin Arg Ser Pro Gin Ser Leu Ser Ala Ala 
50 55 60 

He Ala Ala Met Gin Lys Phe Tyr Gly Leu Gin Val Thr Gly Lys Ala 
65 70 75 80 



Asp Ala Asp Thr Met Lys Ala Met Arg Arg Pro Arg Cys Gly Val Pro 
50 85 90 95 

Asp Lys Phe Gly Ala Glu He Lys Ala Asn Val Arg Arg Lys Arg Tyr 
100 105 110 

55 Ala He Gin Gly Leu Lys Trp Gin His Asn Glu He Thr Phe Cys He 

115 120 125 



Gin Asn Tyr Thr Pro Lys Val Gly Glu Tyr Ala Thr Tyr Glu Ala He 

130 135 140 

Arg Lys Ala Phe Arg Val Trp Glu Ser Ala Thr Pro Leu Arg Phe Arg 

145 ~ 150 155 160 

Glu Val Pro Tyr Ala Tyr He Arg Glu Gly His Glu Lys Gin Ala Asp 
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165 170 175 

lie Met lie Phe Phe Ala Glu Gly Phe His Gly Asp Ser Thr Pro Phe 
180 185 190 

5 

Asp Gly Glu Gly Gly Phe Leu Ala His Ala Tyr Phe Pro Gly Pro Asn 
195 200 205 

lie Gly Gly Asp Thr His Phe Asp Ser Ala Glu Pro Trp Thr Val Arg 
10 210 ~ 215 220 

Asn Glu Asp Leu Asn Gly Asn Asp He Phe Leu Val Ala Val His Glu 
225 230 235 240 

15 Leu Gly His Ala Leu Gly Leu Glu His Ser Ser Asp Pro Ser Ala He 

245 250 255 



20 



35 



50 



Met Ala Pro Phe Tyr Gin Trp Met Asp Thr Glu Asn Phe Val Leu Pro 
260 265 270 

Asp Asp Asp Arg Arg Gly He Gin Gin Leu Tyr Gly Gly Glu Ser Gly 
275 280 285 



Phe Pro Thr Lys Met Pro Pro Gin Pro Arg Thr Thr Ser Arg Pro Ser 
25 290 " 295 300 

Val Pro Asp Lys Pro Lys Asn Pro Thr Tyr Gly Pro Asn He Cys Asp 
305 ~ 310 315 320 

3 0 Gly Asn Phe Asp Thr Val Ala Met Leu Arg Gly Glu -Met Phe Val Phe 

325 330 335 



Lys Glu Arg Trp Phe Trp Arg Val Arg Asn Asn Gin Val Met Asp Gly 

340 345 350 

Tyr Pro Met . Pro He Gly Gin Phe Trp Arg Gly Leu Pro Ala Ser He 
355 360 365 



Asn Thr Ala Tyr Glu Arg Lys Asp Gly Lys Phe Val Phe Phe Lys Gly 
40 370 375 380 

Asp Lys His Trp Val Phe Asp Glu Ala Ser Leu Glu Pro Gly Tyr Pro 
385 390 395 400 

45 Lys His He Lys Glu Leu Gly Arg Gly Leu Pro Thr Asp Lys He Asp 

405 410 415 



Ala Ala Leu Phe Trp Met Pro Asn Gly Lys Thr Tyr Phe Phe Arg Gly 
420 425 430 

Asn Lys Tyr Tyr Arg Phe Asn Glu Glu Leu Arg Ala Val Asp Ser Glu 
435 440 445 



Tyr Pro Lys Asn He Lys Val Trp Glu Gly He Pro Glu Ser Pro Arg 
55 450 455 460 

Gly Ser Phe Met Gly Ser Asp Glu Val Phe Thr Tyr Phe Tyr Lys Gly 
465 470 475 480 

60 Asn Lys Tyr Trp Lys Phe Asn Asn Gin Lys Leu Lys Val Glu Pro Gly 

485 490 495 

- Tyr Pro Lys Ser Ala Leu Arg Asp Trp Met Gly Cys Pro Ser Gly Gly 
500 505 510 
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10 



40 



55 



Arg Pro Asp Glu Gly Thr Glu Glu Glu Thr Glu Val He He He Glu 
515 520 525 

Val Asp Glu Glu Gly Gly Gly Ala Val Ser Ala Ala Ala Val Val Leu 
530 535 540 

Pro Val Leu Leu Leu Leu Leu Val Leu Ala Val Gly Leu Ala Val Phe 
545 550 555 560 

Phe Phe Arg Arg His Gly Thr Pro Arg Arg Leu Leu Tyr Cys Gin Arg 
565 570 575 



Ser Leu Leu Asp Lys Val 
1.5 580 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 607 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: peptide 



3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met He Leu Leu Thr Phe Ser Thr Gly Arg Arg Leu Asp Phe Val His 
15 10 15 

3 5 His Ser Gly Val Phe Phe Leu Gin Thr Leu Leu Trp He Leu Cys Ala 

20 25 30 

Thr Val Cys Gly Thr Glu Gin Tyr Phe Asn Val Glu Val Trp Leu Gin 
35 " 40 45 

Lys Tyr Gly Tyr Leu Pro Pro Thr Asp Pro Arg Met Ser Val Leu Arg 
50 " 55 60 



Ser Ala Glu Thr Met Gin Ser Ala Leu Ala Ala Met Gin Gin Phe Tyr 
45 65 70 75 80 

Gly He Asn Met Thr Gly Lys Val Asp Arg Asn Thr He Asp Trp Met 
85 90 95 

50 Lys Lys Pro Arg Cys Gly Val Pro Asp Gin Thr Arg Gly Ser Ser Lys 

100 105 110 



Phe His He Arg Arg Lys Arg Tyr Ala Leu Thr Gly Gin Lys Trp Gin 
115 120 125 

His Lys His lie Thr Tyr Ser He Lys Asn Val Thr Pro Lys Val Gly 
130 135 140 



Asp Pro Glu Thr Arg Lys Ala He Arg Arg Ala Phe Asp Val Trp Gin 
60 145 150 155 160 

Asn Val Thr Pro Leu Thr Phe Glu Glu Val Pro Tyr Ser Glu Leu Glu 
165 170 175 . 



15 
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Asn Gly Lys Arg Asp Val Asp lie Thr lie lie Phe Ala Ser Gly Phe 
180 185 190 

His Gly Asp Ser Ser Pro Phe Asp Gly Glu Gly Gly Phe Leu Ala His 
5 195 200 205 

Ala Tyr Phe Pro Gly Pro Gly lie Gly Gly Asp Thr His Phe Asp Ser 
210 " 215 220 

10 Asp Glu Pro Trp Thr Leu Gly Asn Pro Asn His Asp- Gly Asn Asp Leu 

225 230 235 240 

Phe Leu Val Ala Val His Glu Leu Gly His Ala Leu Gly Leu Glu His 
245 250 255 

Ser Asn Asp Pro Thr Ala lie Met Ala Pro Phe Tyr Gin Tyr Met Glu 
260 265 270 

Thr Asp Asn Phe Lys Leu Pro Asn Asp Asp Leu Gin Gly lie Gin Lys 
20 275 280 285 

lie Tyr Gly Pro Pro Asp Lys He Pro Pro Pro Thr Arg Pro Leu Pro 
290 ~ 295 300 

25 Thr Val Pro Pro His Arg Ser He Pro Pro Ala Asp Pro Arg Lys Asn 

305 310 315 320 

Asp Arg Pro Lys Pro Pro Arg Pro Pro Thr Gly Arg Pro Ser Tyr Pro 
325 330 335 

3 0 

Gly Ala Lys Pro Asn He Cys Asp Gly Asn Phe Asn Thr Leu Ala He 
340 345 350 

Leu Arg Arg Glu Met Phe Val Phe Lys Asp Gin Trp Phe Trp Arg Val 
35 355 360 365 

Arg Asn Asn Arg Val Met Asp Gly Tyr Pro Met Gin He Thr Tyr Phe 
370 375 380 

40 Trp Arg Gly Leu Pro Pro Ser He Asp Ala Val Tyr Glu Asn Ser Asp 

385 " ~ 390 395 400 

Gly Asn Phe Val Phe Phe Lys Gly Asn Lys Tyr Trp Val Phe Lys Asp 
405 410 415 

45 

Thr Thr Leu Gin Pro Gly Tyr Pro His Asp Leu He Thr Leu Gly Ser 
420 425 430 

Gly He Pro Pro His Gly He Asp Ser Ala He Trp Trp Glu Asp Val 
50 " 435 440 445 

Gly Lys Thr Tyr Phe Phe Lys Gly Asp Arg Tyr Trp Arg Tyr Ser Glu 
450 455 460 

55 Glu Met Lys Thr Met Asp Pro Gly Tyr Pro Lys Pro He Thr Val Trp 

465 "* 470 475 480 

Lvs Gly He Pro Glu Ser Pro Gin Gly Ala Phe Val His Lys Glu Asn 
485 490 495 

60 Glv phe Thr Tyr Phe Tyr Lys Gly Lys Glu Tyr Trp Lys Phe Asn Asn 

500 505 510 

Gin He Leu Lys Val Glu Pro Gly Tyr Pro Arg Ser He Leu Lys Asp 
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515 520 



525 



Phe Met Gly Cys Asp Gly Pro Thr Asp Arg Val Lys Glu QTy His Ser 
5 530 535 54 0 

Pro Pro Asp Asp Val Asp He Val He Lys Leu Asp Asn Thr Ala Ser 
545 550 555 550 

Thr Val Lys Ala lie Ala He Val He Pro Cys He Leu Ala Leu Cys 
U 565 570 575 

Leu Leu Val Leu Val Tyr Thr Val Phe Gin Phe Lys Arg Lys Gly Thr 
580 585 590 



Pro Arg His He Leu Tyr Cys Lys Arg Ser Met Gin Glu T r o Val 
595 600 605 
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